Appln.No.: 09/853,880 
Amendment dated July 14, 2003, 2003 
Reply to Office Action of March 27, 2003 

The Rejection of Claims 13 and 15-22 under 35 U.S.C. $112. second paragraph 

Claims 13 and 15-22 stand rejected as indefinite. Applicant respectfully traverses. 

The term "GPNMB" is objected to as having multiple meanings and being a mere 
laboratory term. However, GPNMB is actually a term of art. This is the designation which those 
of skill in the art have given to a particular gene. The specification at page 26, Table 2, last line, 
describes the gene with a gene name (Glycoprotein (transmembrane) nmb), a gene symbol 
(GPNMB), a cellular location (membrane), a GenBank number (X76534), primer pairs for 
amplification of the gene (SEQ ID NO: 15 and 16), and a literature citation (Weterman, 1995, 
Int. J. Cancer 60: 73-81). 

If one uses the GenBank number one finds a particular amino acid sequence for the 

protein: 

MECLYYFLGFLLLAARLPLDAAKRFHDVLGNERPSAYMREHNQLNGWSSDENDWNEK 

LYPVWKRGDMRWKNSWKGGRVQAVLTSDSPALVGSNITFAVNLIFPRCQKEDANGNI 

VYEKNCRNEAGLSADPYVYNWTAWSEDSDGENGTGQSHHNVFPDGKPFPHHPGWRR 

WNFIYVFHTLGQYFQKLGRCSVRVSVNTANVTLGPQLMEVTVYRRHGRSTINYKWSFG 

DNTGLFVSTNHTVNHTYVLNGTFSLNLTVKAAAPGPCPPPPPPPRPSKPTPSLGPAGDNP 

LELSRIPDENCQINRYGHFQATITIVEGILEVNIIQMTDVLMPVPWPESSLIDFWTCQGSIP 

TEVCTIISDPTCEITQNTVCSPVDVDEMCLLTVRRTFNGSGTYCVNLTLGDDTSLALTSTL 

ISVPDRDPASPLRMANSALISVGCLAIFVTVISLLVYKKHKEYNPIENSPGNWRSKGLSV 

FLNRAKAVFFPGNQEKDPLLKNQEFKGVS. See Tab A. 

Thus the term GPNMB is neither indefinite nor a laboratory term. It has a specific 
meaning in the art which one of ordinary skill in the art would know, given the teachings of the 
specification. Although applicants do not believe it to be necessary, claim 13 has been amended 
to recite the longer name "Glycoprotein (transmembrane) nmb." 

The term "contacting" in the phrase "contacting cells with an antibody" of claim 13 is 
said to be unclear. The Patent Office asks whether applicant intends "for the compounds to be 
delivered by placing the antibody in a cell of a glioblastoma."' The step of contacting does not 
require that an antibody be placed in a cell. It requires that the antibody be placed in contact 



1 If the question arose due to the missing preposition in the claim as filed, that missing preposition ("with") has now 
been inserted by amendment. 
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with a cell. If the cell is in culture, then the antibody is placed in the culture medium. If the cell 
is in an animal, then the antibody is delivered into the animal so that the antibody can contact the 
cell. 

The plain meaning of the word "contacting" is intended and there is no indefmiteness. 
The dictionary^ defines the verb "to contact" as "to bring or put into contact." The noun 
"contact" is defined as "the coming together or touching of two objects or surfaces." These 
meanings are what applicant intends, as discussed with particularity above with regard to cells 
and antibodies. 

The term "extracellular epitope" is also said to be indefinite because the metes and 
bounds cannot be defined. The Office Action explains that any combination of three or more 
amino acids found within the extracellular domain of GPNMB can be considered an epitope. 
Even if the definition of an extracellular epitope is accurately provided by the Office Action, it 
does not follow that the term is indefinite. The exti-acellular domain of GPNMB is a known 
domain of a known protein. The claim simply requires that the antibody which is used be 
capable of binding to the extracellular domain. The claim is clear and definite as written. 
The Rejection of Claims 13 and 15-22 Under 35 U.S.C. S112, first paragraph 

Claims 13 and 15-22 are rejected as not enabled by the specification. The specification 
allegedly does not enable any person skilled in the art to practice the invention commensurate in 
scope with these claims. This rejection is respectfially traversed. 

The Office Action provides its rationales for the rejection at pages 4 and 5. They are 
quoted below and addressed individually. 

• "Nowhere in the specification does it specifically teach the delivery of an agent to a 

specific epitope of GPNMB, because the GPNMB protein has never actually been defined 
in the specification. There is little information of the protein itself. " 

The Office Action urges that the protein is not defined. However, as pointed out above, 
the GPNMB protein was known in the art, and the specification references its complete amino 



2 The American Heritage Dictionary, second college edition, Houghton Mifflin Company, Boston, 1982. 
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acid sequence in GenBank. It is not clear what additional information about the protein might be 
necessary to practice the invention. 

Applicants have applied the SMART (Simple Modular Architecture Research Tool) tool 
to automatically identify the extracellular domain of GPNMB. SMART is made available to the . 
public by the European Molecular Biology Laboratory on the Litemet. As shown at Tab B, 
amino acids 1 to 483 have a >0.98 probability of being extracellular. Thus one or ordinary skill 
in the art would have no trouble identifying the extracellular domain or making antibodies which 
specifically bind to it. 

• "The specification does not teach whether the protein even actually exists. " 

The specification implicitly teaches that the protein exists. The specification teaches 
throughout the use of antibodies specific for the protein. The specification teaches throughout 
the detection of elevated expression levels using Western blotting, enzyme-linked 
immunosorbent assay, radioimmimoassay, and fluorescence activated cell sorting. If the protein 
did not exist, then these teachings would be for nought. 

Applicants may not, arguendo, have proven that the protein exists. However, applicants 
are not obliged to prove such things. The Patent Office is obliged to assume the truth of the 
specification's teachings unless it has good reason to doubt it. See, MPEP §2107.02, which 
quotes fi-om/« re Marzocchi, 439 F.2d 220, 223, 169 U.S.P.Q. 367, 369 (C.C.P.A. 1971) as 
follows: 

[A] specification disclosure which contains a teaching of 
the manner and process of making and using the invention in 
terms which correspond in scope to those used in describing 
and defining the subject matter sought to be patented must be 
taken as in compliance with the enabling requirement of the 
first paragraph of § 112 unless there is reason to doubt the 
objective truth of the statements contained therein which must 
be relied on for enabling support (emphasis added). 

Thus, the specification's alleged failure to explicitly teach that the protein exists cannot 
render the specification not enabling. 
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• "// is a well established fact that in normal circumstances, the level of transcription 
parallels that of translation, however, in certain circumstances, the level of mRNA 
transcription is uncoupled from protein translation. Therefore, given the fact that it is 
possible for some transcription and translation events to be unparalleled, it is possible 
GPNMB may not be translated from the unregulated mRNA detected in the instant 
invention, "(citations committed) 

The Office Action speculates that GPNMB protein translation may be uncoupled from 
GPNMB mRNA transcription. This speculation is contrary to what the Office Action itself sets 
out as the "established fact in normal circumstances." There is no factual or evidentiary basis 
provided by the Office Action for asserting that GPNMB departs from the normal circumstance. 
However, as stated in the MPEP, there must be a "reason to doubt the objective truth of the 
statements contained therein." Mere speculation about a possibility is not a reason to doubt. 
Thus this rationale cannot properly serve as a basis for rejecting the claims as not enabled. 

• "How would one of skill in the art know whether there are even enough epitopes present 
on the surface of a glioblastoma for the method to be effective in delivering an agent. " 

The Office Action speculates that one requires a certain number of epitopes to make 
delivery effective. However, the specification teaches that the GPNMB protein is highly 
expressed in glioblastoma. See, e.g.. Figure 3. Moreover, the amino acid sequence of GPNMB 
protein indicates that 483/560 residues are likely to be extracellular {i.e., 86 % of the protein). 
See Tab B. Thus, the record evidence suggests that there will be sufficient exfracellular epitopes 
on the surface of glioblastoma cells. The imputation to the contrary has no factual or scientific 
basis. 

• "If there is only a small percentage of protein found on the surface of the glioblastoma 
cell, then the antibody conjugate that is administered is left circulating in the blood 
stream thereby eliciting anti-idiotypic antibodies. " 
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The Office Action speculates that administration of an antibody-conjugate may cause a 
presumably undesirable side effect, elicitation of anti-idiotypic antibodies. The Office Action 
does not explain how that would render the claimed method unworkable. Nonetheless, this is 
again mere speculation, without the slightest bit of evidentiary support or scientific basis. 

• "Because the working examples of the instant invention have only provided a method of 
screening, amplification, hybridization, and other general methods of probing, it is not 
commensurate in scope to the claims that read on methods of delivering a reagent using 
an antibody to GPNMB. " 

The Office Action makes the false equation that the scope of the working examples must 
be commensurate with the scope of the claims. In fact, the law requires that the scope of the 
specification 's teachings as a whole must be commensurate with the scope of the claims ("the 
only relevant concern should be whether the scope of enablement provided to one skilled in the 
art by the disclosure is commensurate with the scope of protection sought by the claims." MPEP 
§2164.08. The absence of working examples is not determinative of non-enablement. "How a 
teaching is set forth, by specific example or broad terminology, is not important." MPEP 
§2164.08. 

The Patent Office has not met its burden in showing that the specification does not 
provide enablement that is commensurate with the scope of the claims. The burden is on the 
Patent Office to "back up assertions of its own with acceptable evidence or reasoning which is 
inconsistent with the contested statement [of the application]." See MPEP § 2164.04. The 
Patent Office has simply used speculation without ever providing evidence inconsistent with the 
teachings of the specification. Withdrawal of this rejection is requested because the Patent 
Office has not made a prima facie case of non-enablement. 
The Rejection of Claims 13. 15. 18-20 and 22 under 35 U.S.C. S102(b) 

Claims 13, 15, 18-20, and 22 stand rejected as anticipated by Johnson (U.S. 5,352,447). 
This rejection is respectfully traversed. 
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Johnson is cited as teaching use of an antibody that specifically recognizes the transferrin . 
receptor. 

Claims 13, 15, 18-20 and 22 are directed to use of an antibody that specifically binds to 
an extracellular epitope of GPNMB. 

GPNMB and transferrin receptor are two distinct proteins. The office action provides no 
evidence whatsoever that they are the same protein or even that they are similar proteins. In the 
absence of any such evidence, there is no reasonable basis on which to conclude that these 
proteins share epitopes. There is no reasonable basis to conclude that antibodies specific for 
transferrin receptor would specifically bind to GPNMB. The PTO has failed to meet its burden 
in presenting a prima facie case of anticipation, inherency, or equivalence. 

Current estimates of the complexity of the human genome are in the range of 40,000 
genes. The assumption of a one-to-one correspondence of genes-to-proteins would lead to an 
estimate of 40,000 proteins in the human proteome. There is no reason to believe that any two of 
40,000 proteins selected at random would be similar in sequence. 

The PTO justifies its conclusion of equivalence of the transferrin receptor and GPNMB 
on the basis of the failure of applicants to provide the sequence of GPNMB. However, as 
discussed above, the sequence of GPNMB is known in the art and is cited in the specification. 
No sequence for transferrin receptor is found in the cited Johnson patent. Nonetheless, 
applicants have compared the sequences of human transferrin receptor proteins 1 (Tab C) and 2 
(Tab D) to the sequence of GPNMB (Tab A). The NCBI's Blast 2 Sequences tool was used. No 
significant similarity was found. See Tabs E and F. 
The Rejection of Claims 13. 15. and 18-21 under 35 U.S.C. S103(a) 

Claims 13, 15, and 18-21 stand rejected as impatentable over Lazarus in view of either 
Reiter or Maynard. This rejection is respectfully traversed. 

Lazarus, the primary reference, is cited as teaching the assessment of anti-NMB 
antibodies. The PTO asserts that NMB of Lazarus is also known as GPNMB. This, however, is 
a false assertion. Lazarus teaches that Neuromedin B is a decapeptide, i.e., a ten-amino acid 
residue peptide. See page 161, column 1, first sentence. The sequence of neuromedin B is 
provided in Lazarus' Table 1, line 1: GNLWATGHFM-NH2. This sequence does not appear in 
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the sequence of GPNMB (NP_002501). Thus the basis for the rejection must collapse as NMB 
and GPNMB are not the same protein, as erroneously asserted by the PTO. 

Lazarus' neuromedin B decapeptide is produced in vivo from a 121 amino acid-residue 
protein. See NP_066563, particularly residues 47-56 (Tab G). 

Applicants compared the NMB, NP_066563, 121 residue sequence to the 560 residue 
GPNMB sequence of NP_002501 (Tab H) using the Blast 2 Sequences program of the NCBI.^ 
No significant similarity was found. (See Tab I.) Thus, the PTO's contention that NMB and 
GPNMB are the same is simply not true. 

Applicants also searched the entire GenBank CDS translations using BlastP and the 
neuromedin B, 121 -residue sequence. Of the 18 hits, none were GPNMB. See Tab J. 

Reiter and Maynard do not remedy this deficiency. Both are cited as teaching 
"conjugation of antibodies in general to either toxic agents or labels for the treatment or imaging 
of cancer." Thus, even if one were motivated to combine the teachings of the cited reference, 
one would not achieve the present invention. None of the cited references teaches antibodies to 
GPNMB. 

Withdrawal of this rejection is respectfully requested. 



Respectfully submitted, 
BANNER & WITCOFF, LTD. 



Dated: July 14, 2003 




Registration No . 3 2 , 1 4 1 

Customer No. 22907 



3 Note that the protein sequences are derived from the gene sequence M2 155 1.1 and X76534:l. Each of these gene 
sequences is cited in the specification at Table 2. 
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2669 bp mRNA linear PRI 09-FEB-1995 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 
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H.sapiens NMB mRNA 
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NMB gene. 

Homo sapiens (htoman) 
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1 (bases 1 to 2669) 

Weterman, M. A. , Ajubi,N., van Dinter,I.M., Degen,W.G., van 
Muijen,G.N., Ruitter,D.J. and Bloemers, H. P. 

nmb, a novel gene, is expressed in low-metastatic human melanoma 

cell lines and xenografts 

Int. J. Cancer 60 (1), 73-81 (1995) 

95113576 

7814155 

2 

Weterman, M. 
Direct Submission 

Submitted ( 03-DEC-l 993 ) M. Weterman, University of Nijmegen, Dept 
of Biochemistry, PO Box 9101, 6500 HB Nijmegen, NETHERLANDS 

Location/Qualifiers 

1. .2669 

/organism="Homo sapiens" 

/mol_type="mRNA " 

/db_xref="taxon: 9606" 

/clone="NMB" 

/cell_line="MVl" 

/clone_lib="MVl" 

1..2669 

/gene="NMB" 

92. .1774 

/gene="NMB" 

/codon_start=l 

/protein id=" CAA54044 . 1 " 

/db_xref="GI : 666043" 

/db_xref="SWISS-PROT:Q14 956" 

/translation="MECLYYFLGFLLLAARLPLDAAKRFHDVLGNERPSAYMREHNQL 
NGWSSDENDWNEKLYPVWKRGDMRWKNSWKGGRVQAVLTSDSPALVGSNITFAVNLIF 
PRCQKEDANGNIVYEKNCRNEAGLSADPYVYNWTAWSEDSDGENGTGQSHHNVFPDGK 
PFPHHPGWRRWNFIYVFHTLGQYFQKLGRCSVRVSVNTANVTLGPQLMEVTVYRRHGR 
AYVPIAQVKDVYVVTDQIPVFVTMFQKNDRNSSDETFLKDLPIMFDVLIHDPSHFLNY 
STINYKWSFGDNTGLFVSTNHTVNHTYVLNGTFSLNLTVKAAAPGPCPPPPPPPRPSK 
PTPSLGPAGDNPLELSRIPDENCQINRYGHFQATITIVEGILEVNIIQMTDVLMPVPW 
PESSLIDFVVTCQGSIPTEVCTIISDPTCEITQNTVCSPVDVDEMCLLTVRRTFNGSG 
TYCVNLTLGDDTSLALTSTLISVPDRDPASPLRMANSALISVGCLAIFVTVISLLVYK 
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KHKEYNPIENSPGNVVRSKGLSVFLNRAKAVFFPGNQEKDPLLKNQEFKGVS" 
mat peptide 155. .1771 

/gene="NMB" 

/product=" unnamed" 
BASE COUNT 752 a 595 c 597 g 725 t 

ORIGIN . 

1 cagatgccag aagaacactg ttgctcttgg tggacgggcc cagaggaatt cagagttaaa 
61 ccttgagtgc ctgcgtccgt gagaattcag catggaatgt ctctactatt tcctgggatt 
121 tctgctcctg gctgcaagat tgccacttga tgccgccaaa cgatttcatg atgtgctggg 
181 caatgaaaga ccttctgctt acatgaggga gcacaatcaa ttaaatggct ggtcttctga 
241 tgaaaatgac tggaatgaaa aactctaccc agtgtggaag cggggagaca tgaggtggaa 
301 aaactcctgg aagggaggcc gtgtgcaggc ggtcctgacc agtgactcac cagccctcgt 
361 gggctcaaat ataacatttg cggtgaacct gatattccct agatgccaaa aggaagatgc 
421 caatggcaac atagtctatg agaagaactg cagaaatgag gctggtttat ctgctgatcc 
481 atatgtttac aactggacag catggtcaga ggacagtgac ggggaaaatg gcaccggcca 
541 aagccatcat aacgtcttcc ctgatgggaa accttttcct caccaccccg gatggagaag 
601 atggaatttc atctacgtct tccacacact tggtcagtat ttccagaaat tgggacgatg 
661 ttcagtgaga gtttctgtga acacagccaa tgtgacactt gggcctcaac tcatggaagt 
721 gactgtctac agaagacatg gacgggcata tgttcccatc gcacaagtga aagatgtgta 
781 cgtggtaaca gatcagattc ctgtgtttgt gactatgttc cagaagaacg atcgaaattc 
841 atccgacgaa accttcctca aagatctccc cattatgttt gatgtcctga ttcatgatcc 
901 tagccacttc ctcaattatt ctaccattaa ctacaagtgg agcttcgggg ataatactgg 
961 cctgtttgtt tccaccaatc atactgtgaa tcacacgtat gtgctcaatg gaaccttcag 
1021 ccttaacctc actgtgaaag ctgcagcacc aggaccttgt ccgccaccgc c.accaccacc 
1081 cagaccttca aaacccaccc cttctttagg acctgctggt gacaaccccc tggagctgag 
1141 taggattcct gatgaaaact gccagattaa cagatatggc cactttcaag ccaccatcac 
1201 aattgtagag ggaatcttag aggttaacat catccagatg acagacgtcc tgatgccggt 
1261 gccatggcct gaaagctccc taatagactt tgtcgtgacc tgccaaggga gcattcccac 
1321 ggaggtctgt accatcattt ctgaccccac ctgcgagatc acccagaaca cagtctgcag 
1381 ccctgtggat gtggatgaga tgtgtctgct gactgtgaga cgaaccttca atgggtctgg 
1441 gacgtactgt gtgaacctca ccctggggga tgacacaagc ctggctctca cgagcaccct 
1501 gatttctgtt cctgacagag acccagcctc gcctttaagg atggcaaaca gtgccctgat 
1561 ctccgttggc tgcttggcca tatttgtcac tgtgatctcc ctcttggtgt acaaaaaaca 
1621 caaggaatac aacccaatag aaaatagtcc tgggaatgtg gtcagaagca aaggcctgag 
1681 tgtctttctc aaccgtgcaa aagccgtgtt cttcccggga aaccaggaaa aggatccgct 
1741 actcaaaaac caagaattta aaggagtttc ttaaatttcg accttgtttc tgaagctcac 
1801 ttttcagtgc cattgatgtg agatgtgctg gagtggctat taaccttttt ttcctaaaga 
1861 ttattgttaa atagatattg tggtttgggg aagttgaatt ttttataggt taaatgtcat 
1921 tttagagatg gggagaggga ttatactgca ggcagcttca gccatgttgt gaaactgata 
1981 aaagcaactt agcaaggctt cttttcatta ttttttatgt ttcacttata aagtcttagg 
2041 taactagtag gatagaaaca ctgtgtcccg agagtaagga gagaagctac tattgattag 
2101 agcctaaccc aggttaactg caagaagagg cgggatactt tcagctttcc atgtaactgt 
2161 atgcataaag ccaatgtagt ccagtttcta agatcatgtt ccaagctaac tgaatcccac 
2221 ttcaatacac actcatgaac tcctgatgga acaataacag gcccaagcct gtggtatgat 
2281 gtgcacactt gctagactca gaaaaaatac tactctcata aatgggtggg agtattttgg 
•2341 tgacaaccta ctttgcttgg ctgagtgaag gaatgatatt catatattca tttattccat 
2401 ggacatttag ttagtgcttt ttatatacca ggcatgatgc tgagtgacac tcttgtgtat 
2461 atttccaaat ttttgtatag tcgctgcaca tatttgaaat catatattaa gactttccaa 
2521 agatgaggtc cctggttttt catggcaact tgatcagtaa ggatttcacc tctgtttgta 
2581 actaaaacca tctactatat gttagacatg acattctttt tctctccttc ctgaaaaata 
2641 aagtgtggga agagacaaaa aaaaaaaaa 
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Sarah, 

This program predicts extracellular and intercellular of the protein. 

The figure is in color and listed below is probability table used to draw the figure above. 

As seen from 1-483 amino acid has a probability of >0.98 to be Extracellular 
and from 509-560 amino acid has a probability of >0.98 to be intercellular 
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38 M 


0 


00106 


0 


00000 


0 


99895 


39 R 


0 


00106 


0. 


00000 


0. 


99895 


40 E 


0 


00106 


0. 


00000 


0. 


99895 


41 H 


0 


00106 


0. 


00000 


0. 


99895 


42 N 


0 


00106 


0. 


00000 


0. 


99895 
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43 Q 0.00106 

44 L 0.00106 

45 N 0.00106 

46 G 0.00106 

47 W 0.00106 

48 S 0.00106 

49 S 0.00106 

50 D 0.00106 

51 E 0.00106 

52 N 0.00106 

53 D 0.00106 

54 W 0.00106 

55 N 0.00106 

56 E 0.00106 

57 K 0.00106 

58 L 0.00106 

59 Y 0.00106 

60 P 0.00106 

61 V 0.00106 

62 W 0.00106 

63 K 0.00106 

64 R 0.00106 

65 G 0.00106 

66 D 0.00106 

67 M 0.0010.6 

68 R 0.00106 

69 W 0.00106 

70 K 0.00106 

71 N 0.00106 

72 S 0.00106 

73 W 0.00106 

74 K 0.00105 

75 G 0.00104 

76 G 0.00103 

77 R 0.00102 

78 V 0.00099 

79 Q 0.00099 

80 A 0.00090 

81 V 0.00088 

82 L 0.00088 

83 T 0.00088 

84 S 0.00088 

85 D 0.00088 

86 S 0.00088 

87 P 0.00087 

88 A 0.00087 

89 L 0.00087 

90 V 0.00087 

91 G 0.00087 

92 S 0.00087 

93 N 0.00087 

94 I 0.00087 

95 T 0.00087 

96 F 0.00087 

97 A 0 . 00087 

98 V 0.00088 

99 N 0.00088 

100 L 0.00088 

101 I 0.00089 

102 F 0.00091 

103 P 0.00100 

104 R 0.00105 

105 C 0.00105 

106 Q 0.00105 

107 K 0.00106 

108 E 0.00106 

109 D 0.00106 

110 A 0.00106 

111 N 0.00106 

112 G 0.00106 

113 N 0.00106 



0.00000 
0.00000 
0.00000 
0 .00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0. 00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0. 00000 
0.00000 
0.00000 
0. 00002 
0. 00003 
0.00004 
0.00008 
0.00010 
0.00026 
0.00032 
0.00032 
0. 00033 
0.00034 
0.00034 
0. 00036 
0.00036 
0.00037 
0.00037 
0. 00037 
0.00037 
0.00037 
0.00037 
0. 00037 
0.00037 
0.00036 
0.00036 
0.00034 
0.00031 
0.00031 
0.00028 
0.00025 
0.00008 
0. 00001 
0.00001 
0.00000 
0.00000 
0. 00000 
0. 00000 
0.00000 
0. 00000 
0. 00000 
0.00000 



0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0. 99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0.99895 

0 . 99894 

0.99894 

0.99893 

0 . 99893 

0.99892 

0.99884 

0.9988 

0.99879 

0.99879 

0.99879 

0.99878 

0 . 99877 

0.99876 

0.99876 

0.99876 

0.99876 

0.99876 

0.99876 

0. 99876 

0.99876 

0.99876 

0.99876 

0. 99877 

0. 99878 

0.99881 

0 . 99881 

0. 99883 

0.99884 

0. 99892 

0.99894 

0. 99894 

0.99894 

0.99894 

0.99894 

0. 99894 

0. 99894 

0 . 99894 

0 . 99894 

0 . 99894 



Copied from'l0030861 on 03/21/2006 



114 I 0.00106 

115 V 0.00106 

116 Y 0.00106 

117 E 0.00106 

118 K 0.00106 

119 N 0.00106 

120 C 0.00106 

121 R 0.00106 

122 N 0.00106 

123 E 0.00106 

124 A 0.00106 

125 G 0.00106 

126 L 0.00106 

127 S 0.00106 

128 A 0.00106 

129 D 0.00106 

130 P 0.00106 

131 Y 0.00106 

132 V 0.00106 

133 Y 0.00106 

134 N 0.00106 

135 W 0.00106 

136 T 0.00106 

137 A 0.00106 

138 W 0.00106 

139 S 0.00106 

140 E 0.00106 

141 D 0.00106 

142 S 0.00106 

143 D 0.00106 

144 G 0.00106 

145 E 0.00106 

146 N 0.00106 

147 G 0.00106 

148 T 0.00106 

149 G 0.00106 

150 Q 0.00106 

151 S 0.00106 

152 H 0.00106 

153 H 0.00106 

154 N 0.00106 

155 V 0.00106 

156 F 0.00106 

157 P 0.00106 

158 D 0.00106 

159 G 0.00106 

160 K 0.00106 

161 P 0.00106 

162 F 0.00106 

163 P 0.00106 

164 H 0.00106 

165 H 0.00106 

166 P 0.00105 

167 G 0.00105 

168 W 0.00105 

169 R 0.00105 

170 R 0.00105 

171 W 0.00105 

172 N 0.00105 

173 F 0.00105 

174 I 0.00105 

175 Y 0.00105 

176 V 0.00105 

177 F 0.00105 

178 H 0.00105 

179 T 0.00105 

180 L 0.00105 

181 G 0.00105 

182 Q 0.00105 

183 Y 0.00105 

184 F 6.00105 



0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0 . 00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0 . 00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00001 
0.00001 
0.00001 
0.00001 
0.00002 
0.00002 
0 . 00002 
0.00002 
0.00002 
0.00002 
0.00002 
0.00002 
0.00002 
0. 00002 
0. 00002 
0. 00002 
0.00002 
0.00002 



0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0:99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0 . 99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0.99894 
0. 99894 
0. 99894 
0.99894 
0. 99894 
0. 99894 
0. 99894 
0.99894 
0. 99894 
0. 99894 
0. 99894 
0. 99894 
0.99894 
0. 99894 
0. 99894 
0. 99893 
0.99893 
0. 99893 
0. 99893 
0. 99893 
0. 99893 
0. 99893 
0. 99893 
0. 99893 
0.99893 
0. 99893 
0. 99893 
0. 99893 
0. 99893 
0. 99893 
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185 Q 0.00106 

186 K 0.00106 

187 L 0.00106 

188 G 0.00106 

189 R 0.00106 

190 C 0.00106 

191 S 0.00106 
. 192 V 0.00106 

193 R 0.00106 

194 V 0.00106 

195 S 0.00106 

196 V 0.00106 

197 N 0.00106 

198 T 0.00106 

199 A 0.00106 

200 N 0.00106 

201 V 0.00106 

202 T 0.00106 

203 L 0.00106 

204 G 0.00106 

205 P 0.00106 

206 Q 0.00106 

207 L 0.00106 

208 M 0.00106 

209 E 0.00106 

210 V 0.00106 

211 T 0.00106 

212 V 0.00106 

213 Y 0.00106 

214 R 0.00106 

215 R 0.00106 

216 H 0.00106 

217 G 0.00106 

218 R 0.00106 

219 A 0.00106 

220 Y 0.00106 

221 V 0.00106 

222 P 0.00106 

223 I 0.00106 

224 A 0.00106 

225 Q 0.00106 

226 V 0.00106 

227 K 0.00106 

228 D 0.00106 

229 V 0.00106 

230 Y 0.00106 

231 V 0.00106 

232 V 0.00106 

233 T 0.00106 

234 D 0.00106 

235 Q 0.00106 

236 I 0.00106 

237 P 0.00106 

238 V 0.00106 

239 F 0.00106 

240 V 0.00106 

241 T 0.00106 

242 M 0.00106 

243 F 0.00106 

244 Q 0.00106 

245 K 0.00106 

246 N 0.00106 

247 D 0.00106 

248 R 0.00106 

249 N 0.00106 

250 S 0.00106 

251 S 0.00106 

252 D 0.00106 

253 E 0.00106 

254 T 0.00106 

255 F 0.00106 



0.00001 
0.00001 
0.00001 
0 . 00000 
0.00000 
0 . 00000 
0.00001 
0.00001 
0.00001 
0.00001 
0 . 00001 
0.00001 
0.00001 
0.00001 
0.00001 
0 . 00001 
0 . 00001 
0.00001 
0.00001 
0.00001 
0.00001 
0 . 00001 
0.00001 
0.00001 
0.00001 
0.00001 
0 . 00001 
0.00001 
0 . 00001 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00001 
0 . 00001 
0.00001 
0 . 00001 
0.00001 
0.00001 
0.00001 
0 . 00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0 . 00001 
0.00001 
0. 00001 
0.00000 
0.00000 
0.00000 
0 . 00000 
0 . 00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0. 00000 
0. 00000 
0. 00000 



0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0 . 99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99894 
0. 99894 
0.99894 
0. 99894 
0.99894 
0.99894 
0.99894 
0 . 99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0. 99893 
0. 99893 
0.99893 
0.99893 
0. 99893 
0.99893 
0. 99893 
0.99893 
0.99893 
0.99893 
0 . 99893 
0. 99893 
0.99893 
0. 99893 
0.99893 
0.99894 
0. 99894 
0. 99894 
0. 99894 
0. 99894 
0. 99894 
0. 99894 
0.99894 
0. 99894 
0. 99894 
0. 99894 
0. 99894 
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256 L 0.00106 

257 K 0.00106 

258 D 0.00106 

259 L 0.00106 

260 P 0.00106 

261 I 0.00106 

262 M 0.00106 

263 F 0.00106 

264 D 0.00106 

265 V 0.00106 

266 L 0.00106 

267 I 0.00106 

268 H 0.00106 

269 D 0.00106 

270 P 0.00106 

271 S 0.00106 

272 H 0.00106 

273 F 0.00106 

274 L 0.00106 

275 N 0.00106 

276 Y 0.00106 

277 S 0.00106 

278 T 0.00106 

279 I 0.00106 

280 N 0.00107 

281 Y 0.00107 

282 K 0.00107 

283 W 0.00107 

284 S 0.00107 

285 F 0.00107 

286 G 0.00107 

287 D 0.00107 

288 N 0.00107 

289 T 0.00107 

290 G 0.00107 

291 L 0.00107 

292 F 0.00107 

293 V 0.00107 

294 S 0.00107 

295 T 0.00107 

296 N 0.00107 

297 H 0.00107 

298 T 0.00107 

299 V 0.00107 

300 N 0.00107 

301 H 0.00107 

302 T 0.00107 

303 Y 0.00107 

304 V 0.00106 

305 L 0.00107 

306 N 0.00107 

307 G 0.00107 

308 T 0.00107 

309 F 0.00107 

310 S 0.00107 

311 L 0.00107 

312 N 0.00107 

313 L 0.00107 

314 T 0.00107 

315 V 0.00107 

316 K 0.00108 

317 A 0.00108 

318 A 0.00108 

319 A 0.00108 

320 P 0.00108 

321 G 0.00108 

322 P 0.00108 

323 C 0.00108 

324 P 0.00108 

325 P 0.00108 

326 P 0.00108 



0.00000 
0.00000 
0.00000 
0 . 00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00001 
0.00000 
0.00000 
0 . 00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0. 00001 
0.00001 
0.00001 
0.00001 
0. 00001 
0.00001 
0.00001 
0. 00001 
0.00001 
0.00001 
0. 00001 
0. 00001 
0.00001 
0. 00001 
0. 00001 
0.00001 
0 . 00001 
0. 00001 
0.00000 
0. 00000 

o.ooooo 

O.'OOOOO 
0. 00000 
0.00000 
0. 00000 
0. 00000 
0. 00000 
0. 00000 
0. 00000 
0. 00000 
0.00000 



0.99894 
0.99894 
0.99894 
0 . 99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0 . 99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99893 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0. 99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0 . 99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0 . 99892 
0.99892 
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327 P 0.00108 

328 P 0.00108 

329 P 0.00108 

330 P 0.00108 

331 R 0.00108 

332 P 0.00108 

333 S 0.00108 

334 K 0.00108 

335 P 0.00108 

336 T 0.00108 

337 P 0.00108 

338 S 0.00108 

339 L 0.00108 

340 G 0.00108 

341 P 0.00108 

342 A 0.00108 

343 G 0.00108 

344 D 0.00108 

345 N 0.00108 

346 P 0.00108 

347 L 0.00108 

348 E 0.00108 

349 L 0.00108 

350 S 0.00108 

351 R 0.00108 

352 I 0.00108 

353 P 0.00108 

354 D 0.00108 

355 E 0.00108 

356 N 0.00108 

357 C 0.00108 

358 Q 0.00108 

359 I 0.00107 

360 N 0.00107 

361 R 0.00106 

362 Y 0.00104 

363 G 0.00104 

364 H 0.00103 

365 F 0.00103 

366 Q 0.00102 

367 A 0.00102 

368 T 0.00102 

369 I 0.00101 

370 T 0.00101 

371 I 0.00100 

372 V 0.00100 

373 E 0.00100 

374 G 0.00100 

375 I 0.00100 

376 L 0.00100 

377 E 0.00100 

378 V 0.00100 

379 N 0.00101 

380 I 0.00101 

381 I 0.00104 

382 Q 0.00113 

383 M 0.00113 

384 T 0.00113 

385 D 0.00114 

386 V 0.00114 

387 L 0.00114 

388 M 0.00114 

389 P 0.00114 

390 V 0.00114 

391 P 0.00114 

392 W 0.00114 

393 P 0.00114 

394 E 0.00114 

395 S 0.00114 

396 S 0.00114 

397 L 0.00114 



0.00000 
0.00000 
0.00000 
0. 00000 
0.00000 
0.00000 
0. 00000 
0.00000 
0.00000 
0. 00000 
0. 00000 
0.00000 
0.00000 
0. 00000 
0. 00000 
0.00000 
0. 00000 
0.00000 

o.oOooo 

0. 00000 
0. 00000 
0. 00000 
0.00000 
0. 00000 
0. 00000 
0. 00000 
0:00000 
0.00000 
0. 00000 
0. 00000 
0.00000 
0. 00000 
0.00001 
0. 00002 
0.00003 
0.00013 
0. 00015 
0.00017 
0.00018 
0.00018 
0.00019 
0.00020 
0.00020 
0. 00021 
0.00021 
0. 00021 
0.00021 
0.00021 
0.00021 
0.00021 
0.00021 
0.00021 
0.00021 
0.00020 
0.00018 
0.00007 
0. 00006 
0.00005 
0.00003 
0.00003 
0.00003 
0 . 00002 
0.00002 
0.00002 
0.00001 
0.00001 
0.00000 
0.00000 
0.00000 
0 . 00001 
0.00001 



0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99892 
0.99891 
0.99891 
0.99891 
0.99883 
0.99881 
0.99879 
0.99879 
0.99879 
0.99879 
0.99879 
0.99878 
0.99878 
0.99878 
0.99878 
0.99878 
0.99878 
0.99878 
0.99878 
0.99878 
.0. 99878 
0.99878 
0.99878 
0.99878 
0 . 99881 
0.99881 
0.99882 
0.99884 
0.99884 
0.99884 
0.99884 
0. 99885 
0.99885 
0.99885 
0.99886 
0.99886 
0.99886 
0 . 99885 
0 . 99885 
0.99885 
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398 I 0.00114 

399 D 0.00114 

400 F 0.00114 

401 V 0.00114 

402 V 0.00114 

403 T 0.00114 

404 C 0.00114 

405 Q 0.00114 

406 G 0.00114 

407 S 0.00114 

408 I 0.00114 

409 P 0.00114 

410 T 0.00114 

411 E 0.00114 

412 V 0.00114 

413 C 0.00114 

414 T 0.00114 

415 I 0.00114 

416 I 0.00114 

417 S 0.00115 

418 D 0.00115 

419 P 0.00115 

420 T 0.00115 

421 C 0.00115 

422 E 0.00115 

423 I 0.00115 

424 T 0.00115 

425 Q 0.00115 

426 N 0.00115 

427 T 0.00115 

428 V 0.00115 

429 C 0.00115 

430 S 0.00115 

431 P 0.00115 

432 V 0.00115 

433 D 0.00115 

434 V 0.00115 

435 D 0.00115 

436 E 0.00115 

437 M 0.00115 

438 C 0.00115 

439 L 0.00115 

440 L 0.00115 

441 T 0.00115 

442 V 0.00115 

443 R 0.00115 

444 R 0.00115 

445 T 0.00114 

446 F 0.00113 

447 N 0.00113 

448 G 0.00112 

449 S 0.00111 

450 G 0.00109 

451 T 0.00092 

452 Y 0.00074 

453 C 0.00072 

454 V 0 . 00070 

455 N 0.00070 

456 L 0.00070 

457 T 0.00070 

458 L 0.00070 

459 G 0.00070 

460 D 0.00070 

461 D 0.00070 

462 T 0.00070 

463 S 0.00070 

464 L 0.00070 

465 A 0.00070 

466 L 0.00070 

467 T 0.00071 

468 S 0.00071 



0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00001 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00001 
0. 00002 
0.00003 
0.00005 
0. 00006 
0.00011 
0.00030 
0.00050 
0.00053 
0.00055 
0.00055 
0.00056 
0.00056 
0.00056 
0.00056 
0.00056 
0. 00056 
0.00056 
0.00056 
0. 00056 
0.00055 
0.00055 
0. 00055 
0.00054 



0.99885 
0.99885 
0.99885 
0.99885 
0. 99885 
0.99885 
0.99885 
0. 99885 
0.99885 
0.99885 
0. 99885 
0 . 99885 
0.99885 
0 . 99885 
0.99885 
0.99885 
0 . 99885 
0.99885 
0.99885 
0.99885 
0.99885 
0.99885 
0 . 99885 
0. 99885 
0 . 99885 
0.99885 
0.99885 
0 . 99885 
0.99885 
0.99885 
0 . 99885 
0 . 99885 
0 . 99885 
0 . 99885 
0.99885 
0. 99885 
0.99885 
0.99885 
0 . 99885 
0 . 99885 
0 . 99885 
0.99885 
0 . 99885 
0.99885 
0.99885 
0 . 99885 
0 . 99885 
0.99884 
0 . 99884 
0.99884 
0 . 99884 
0 . 99883 
0 . 99881 
0.99877 
0 . 99876 
0 . 99876 
0.99875 
0.99875 
0 . 99874 
0.99874 
0.99874 
0 . 99874 
0.99874 
0. 99874 
0 . 99874 
0.99874 
0.99874 
0 . 99875 
0 . 99875 
0. 99875 
0. 99875 
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469 T 0.00071 

470 L 0.00071 

471 I 0.00071 

472 S 0.00072 
'." 473 V 0.00075 

474 P 0.00080 

475 D 0.00081 

476 R 0.00081 
! 477 D 0.00081 
■ 478 P 0.00081 

479 A 0.00081 
1 480 S 0.00081 
L. 481 P 0.00081 

482 L 0.00081 

483 R 0.00080 

484 M 0.00072 

485 A 0.00030 
i 486 N 0.00025 

487 S 0.00018 
f- 488 A 0.00007 

489 L 0.00001 
; 4 90 I 0.00000 

491 S 0.00000 

492 V 0.00000 

493 G 0.00000 
! : 494 C 0.00000 

495 L 0.00000 

496 A 0.00000 
i . 497 I 0. 00000 
; 498 F 0.00000 

499 V 0.00000 

500 T 0.00003 
i . 501 V 0 .00004 
! 502 I 0.00048 

503 S 0.00161 
, 504 L 0.00325 
i ; 505 L 0.01311 
( 506 V 0.05549 

507 Y 0.41061 

508 K 0.96651 
! 509 K 0.98966 
( ., 510 H 0.99112 

511 K 0.99165 

512 E 0.99166 
! 513 Y 0.99166 

514 N 0.99166 

515 P 0.99166 

516 I 0.99166 

517 E 0.99166 
; ■ 518 N 0.99166 

519 S 0.99165 

520 P 0.99165 

521 G 0.99164 
if 522 N 0.99164 

523 V 0.99163 

524 V 0.99163 

525 R 0.99163 

526 S 0.99162 

527 K 0.99161 

528 G 0.99155 
{ . 529 L 0.99155 
I . 530 S 0.99155 
' ■ 531 V 0.99155 

532 F 0.99155 
I 533 L 0.99155 

534 N 0.99155 

535 R 0.99155 

536 A 0.99155 
! 537 K 0.99155 
<■ 538 A 0.99155 

539 V 0.99155 



0.00053 
0.00052 
0.00050 
0.00047 
0.00041 
0.00017 
0.00001 
0.00000 
0.00000 
0.00001 
0.00024 
0.00065 
0.00126 
0.00527 
0.01342 
0.23647 
0.54536 
0.58448 
0.68094 
0.91632 
0.98533 
0.99200 
0.99227 
0.99245 
0.99247 
0.99247 
0.99247 
0.99247 
0.99247 
0.99247 
0.99247 
0 . 99244 
0.99243 
0.99199 
0.99085 
0.98921 
0.97933 
0.93691 
0.58161 
0.02517 
0 .00200 
0.00054 
0.00001 
0.00000 
0.00000 
0.00000 
0 . 00000 
0 .00000 
0.00000 
0.00000 
0.00001 
0.00001 
0.00002 
0.00002 
0.00003 
0.00003 
0.00003 
0.00004 
0.00005 
0.00011 
0.00011 
0.00011 
0.00011 
0.00011 
0.00011 
0.00011 
0.00011 
0.00011 
0.00011 
0.00011 
0 . 00011 



0.99876 

0.99877 

0.99879 

0.99882 

0.99884 

0. 99903 

0.99919 

0.99919 

0.99919 

0.99918 

0.99895 

0.99853 

0.99793 

0.99391 

0.98577 

0.76281 

0.45434 

0.41528 

0.31887 

0. 08361 

0.01466 

0. 008 

0.00772 

0.00755 

0.00753 

0. 00753 

0.00753 

0.00753 

0.00753 

0.00753 

0.00753 

0.00753 

0.00753 

0.00753 

0.00753 

0.00754 

0.00756 

0. 0076 

0.00778 

0.00832 

0.00834 

0.00834 

0.00834 

0. 00834 

0.00834 

0.00834 

0.00834 

0.00834 

0.00834 

0.00834 

0.00834 

0. 00834 

0.00834 

0.00834 

0. 00834 

0.00834 

0.00834 

0.00834 

0. 00834 

0.00834 

0.00834 

0.00834 

0.00834 

0.00834 

0.00834 

0.00834 

0.00834 

0. 00834 

0.00834 

0.00834 

0.00834 
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540 F 0.99155 

541 F 0.99155 

542 P 0.99155 

543 G 0.99155 

544 N 0.99155 

545 Q 0.99155 

546 E 0.99155 

547 K 0.99155 

548 D 0.99155 

549 P 0.99155 

550 L 0.99155 

551 L 0.99155 

552 K 0.99155 

553 N 0.99155 

554 Q 0.99155 

555 E 0.99155 

556 F 0.99155 

557 K 0.99155 

558 G 0.99155 

559 V 0.99155 

560 S 0.99155 



0.00011 
0.00011 
0.00010 
0.00008 
0.00005 
0.00004 
0.00001 
0.00000 
0.00000 
0.00000 
0.00000 

o.ooboo 

0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0. 00000 
0.00000 
0.00000 
0.00000 



0.00834 

0.00835 

0.00835 

0 . 00838 

0.0084 

0.00841 

0 . 00844 

0.00845 

0.00845 

0.00845 

0.00845 

0 . 00845 

0.00845 

0.00845 

0.00845 

0.00845 

0.00845 

0.00845 

0.00845 

0.00845 

0.00845 
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r 1: P02786. Transferrin recep...[gi: 136378] 



BLink, Domains, Links 



LOCUS P02786 760 aa linear PRI 15-SEP-2003 

DEFINITION Transferrin receptor protein 1 (TfRl) (TR) (TfR) (Trfr) {CD71 

antigen) (T9) (p90) . 
ACCESSION P02786 
VERSION P02786 GI: 136378 

DBSOURCE swissprot: locus TFR1_HUMAN, accession P02786; 
class: standard. 

extra accessions : Q9UCN0, Q9UCU5, Q9UDF9, Q9UK21, created : Jul 21, 1986. 
sequence updated: Jul 21, 1986. 
annotation updated: Sep 15, 2003. 

xrefs: gi : 37432 , gi : 37433 , gi: 339515 , gi : 339516 , gi : 6164847 , 
gi: 6164848 , gi : 12654696 , gi: 12654697 , gi : 72139, pdb accession 
1CX8, pdb accession 1DE4 

xrefs (non-sequence databases): MEROPSM28 . 972 , GlycoSuiteDBP02786, 
GenewHGNC: 11763, MIM 190010 , GOGO: 0005768 , GOGO: 0005576, 
GOGO:0005887, GOGO: 0004998, GOGO: 0006879, InterProIPR003137, 
PfamPF02225, Pf amPF04389, PfamPF04253 
KEYWORDS Transmembrane; Glycoprotein; Receptor; Lipoprotein; Palmitate; 

Signal-anchor; Endocytosis; Phosphorylation; Polymorphism; 
3D-structure. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (residues 1 to 760) 

Schneider, C. , Owen, M. J., Banville,D. and Williams , J . G . 
Primary structure of human transferrin receptor deduced from the 
mRNA sequence 

Nature 311 (5987), 675-678 (1984) 
85012743 

SEQUENCE FROM N.A. 

2 (residues 1 to 760) 

McClelland, A. , Kuhn,L.C. and Ruddle, F.H. 

The human transferrin receptor gene: genomic organization, and the 
complete primary structure of the receptor deduced from a cDNA 
sequence 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
REMARK 

REFERENCE 



Cell 39 (2 Pt 1), 267-274 (198 
85048936 

SEQUENCE FROM N.A. 

3 (residues 1 to 760) 
Evans, P. and Kemp, J. 
Exon/intron structure of the h 
Gene 199 (1-2), 123-131 (1997) 
98019079 

SEQUENCE. FROM N.A. 
TISSUE=Placenta 

4 (residues 1 to 760) 



4) 



transferrin receptor gene 
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AUTHORS 
TITLE 
JOURNAL 
REMARK 

REFERENCE 
AUTHORS 



JOURNAL 
MEDLINE 
REMARK 

REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 

REFERENCE 
AUTHORS 



JOURNAL 
MEDLINE 



Wheeler, D.L. 

Molecular and evolutionary studies of the transferrin receptor 
Thesis (1999) 
SEQUENCE FROM N.A. 
TISSUE=Placenta 

5 (residues 1 to 760) 

Strausberg, R.L. , Feingold, E.A. , Grouse, L.H., Derge,J.G., 
Klausner, R. D. , Collins, F. S. , Wagner, L., Shenmen, C . M . , Schuler,G.D. 
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer, C. F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko, L. , Marusina,K., Farmer, A. A. , Rubin, G.M., Hong,L., 
Stapleton,M.., Scares, M.B., Bonaldo, M. F. , Casavant, T . L. , 
Scheetz, T.E. , Brownstein,M. J. , Usdin,T.B., Toshiyuki, S. , 
Carninci,?., Prange,C., Raha,S.S., Loquellano, N . A. , Peters, G. J., 
Abramson, R. D. , Mullahy, S . J. , Bosak,S.A., McEwan,-P. J. , 
McKernan, K. J. , Malek,J.A., Gunaratne, P . H . , Richards, S., 
Worley,K.C., Hale,S., Garcia, A.M., Gay, L.J. , Hulyk,S.W., 
Villalon, O.K. , Muzny,D.M., Sodergren, E. J. , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman, M. , Madan,A., Rodrigues, S . , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C, Shevchenko, Y. , 
Bouf fard,G.G. , Blakesley, R. W. , Touchraan, J. W. , Green, E.D., 
Dickson, M. C. , Rodriguez, A. C. , Grimwood, J. , Schmutz,J., Myers, R.M., 
Butterfield, Y.S.N. , Krzywinski,M. I . , Skalska,U., Smailus , D . E . , 
iSchnerch, A. , Schein,J.E., Jones, S. J. M. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
22388257 

SEQUENCE FROM N.A. 
TISSUE=Eye 

6 (residues 1 to 760) 

Shih,Y.J., Baynes,R.D., Hudson, B.G., Flowers , C . H . , Skikne,B.S. and 
Cook, J. D. 

Serum transferrin receptor is a truncated form of tissue receptor 

J. Biol. Chem. 265 (31), 19077-19081 (1990) 

91035436 

SEQUENCE OF 101-119 (STFR) . 

7 (residues 1 to 760) 

Baynes,R.D., Shih,Y.J., Hudson, B.G. and Cook, J. D. 
Characterization of transferrin receptor released by K562 
erythroleukemia cells 

Proc Soc Exp Biol Med 197 (4), 416-423 (1991) 
91334496 

SEQUENCE OF 101-123 (STFR), AND CHARACTERIZATION. 
TISSUE=Erythroleukemia 

8 (residues 1 to 760) 

Coppolino,M. , Migliorini,M. , Argraves, W. S . and Dedhar,S. 
Identification of a novel form of the alpha 3 integrin subunit: 
covalent association with transferrin receptor 
Biochem. J. 306 (Pt 1), 129-134 (1995) 
95169043 

SEQUENCE OF 288-302; 694-708 AND 721-730. 
TISSUE=Prostatic carcinoma 

9 (residues 1 to 760) 

Chicz,R.M., Urban, R.G., Lane,W.S., Gorga,J.C., Stern, L. J., 
Vignali,D.A. and Strominger , J . L . 

Predominant naturally processed peptides bound to HLA-DRl are 
derived from MHC-related molecules and are heterogeneous in size 
Nature 358 (6389), 764-768 (1992) 
92375195 
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REMARK 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 

TITLE 



SEQUENCE OF 680-696. 

10 (residues 1 to 760) 
Jing,S.Q. and Trowbridge, I . S . 

Identification of the intermolecular disulfide bonds of the human 
transferrin receptor and its lipid-attachment site 
EMBO J. 6 (2), 327-331 (1987) 
87218484 

PALMITOYLATION OF CYS-62. 

11 (residues 1 to 760) 

Alvarez, E., Girones,N. and Davis, R.J. 

Intermolecular disulfide bonds are not required for the expression 
of the dimeric state and functional activity of the transferrin 
receptor 

EMBO J. 8 (8), 2231-2240 (1989) 
90005427 

MUTAGENESIS OF CYSTEINES INVOLVED IN INTERMOLECULAR BONDS. 

12 (residues 1 to 760) 

Jing,S.Q., Spencer, T., Miller, K., Hopkins, C. and Trowbridge, I . S . 
Role of the human transferrin receptor cytoplasmic domain in 
endocytosis: localization of a specific signal sequence for 
internalization 

J. Cell Biol. 110 (2), 283-294 (1990) 
90130619 

INTERNALIZATION SEQUENCE, AND MUTAGENESIS OF TYR-20. 

13 (residues 1 to 760) 
Do, S.I. and Cummings , R . D . 

Presence of 0-linked oligosaccharide on a threonine residue in the 
human transferrin receptor 
Glycobiology 2 (4), 345-353 (1992) 
93043836 

CARBOHYDRATE-LINKAGE SITES THR-104. 

14 (residues 1 to 760) 

Hayes, G.R., Enns,C.A. and Lucas, J.J. 

Identification of the 0-linked glycosylation site of the human 
transferrin receptor 
Glycobiology 2 (4), 355-359 (1992) 
93043837 

CARBOHYDRATE -LINKAGE SITES THR-104. 

15 (residues 1 to 760) 

Hayes, G.R., Williams, A., Costello, C . E . , Enns,C.A. and 
The critical glycosylation site of human transferrin 
contains a high-mannose oligosaccharide 
Glycobiology 5 (2), 227-232 (1995) 
95299226 

STRUCTURE OF CARBOHYDRATES ON ASN-727. 

16 (residues 1 to 760) 

Buchegger, F. , Trowbridge, I . S . , Liu,L.F., White, S. and Collawn, J. F. 
Functional analysis of human/chicken transferrin receptor chimeras 
indicates that the carboxy-terminal region is important for ligand 
binding 

Eur. J. Biochem. 235 (1-2), 9-17 (1996) 
96202913 

IDENTIFICATION OF LIGAND BINDING DOMAIN. 

17 (residues 1 to 760) 

Collawn, J. F. , Lai, A., Domingo, D., Fitch, M., Hatton,S. and 
Trowbridge, I . S . 

YTRF is the conserved internalization signal of the transferrin 
receptor, and a second YTRF signal at position 31-34 enhances 
endocytosis 

J. Biol. Chem. 268 (29), 21686-21692 (1993) 



Lucas, J.J. 
receptor 
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MEDLINE 94012749 

REMARK MUTAGENESIS OF ENDOCYTOSIS SITE. 
REFERENCE 18 (residues 1 to 760) 

AUTHORS Alvarez, E., Girones,N. and Davis, R.J. 

TITLE A point mutation in the cytoplasmic domain of the transferrin 

receptor inhibits endocytosis 
JOURNAL Biochem. J. 267 (1), 31-35 (1990) 
MEDLINE 90226333 
REMARK MUTAGENESIS OF TYR-20. 
REFERENCE 19 (residues 1 to 7 60) 

AUTHORS Dubljevic, V. , Sali,A. and Coding, J. W. 

TITLE A conserved RGD (Arg-Gly-Asp) motif in the transferrin receptor is 

required for binding to transferrin 
JOURNAL Biochem. J. 341 (Pt 1), 11-14 (1999) 
MEDLINE 99306849 

REMARK MUTAGENESIS OF RGD MOTIF. 
REFERENCE 20 (residues 1 to 7 60) 

AUTHORS West, A. P. Jr., Giannetti, A.M. , Herr,A.B., Bennett, M. J. , 
Nangiana,J.S., Pierce,J.R., Weiner,L.P., Snow, P.M. and 
Bjorlonan, P. J. 

TITLE Mutational analysis of the transferrin receptor reveals overlapping 
HFE and transferrin binding sites 
J. Mol. Biol. 313 (2), 385-397 (2001) 
21659940 
MUTAGENESIS. 

21 (residues 1 to 760) 

Fuchs,H., Luclcen,U., Tauber,R., Engel,A. and Gessner,R. 
Structural model of phospholipid-reconstituted human transferrin 
receptor derived by electron microscopy 
Structure 6 (10), 1235-1243 (1998) 
98455506 

ELECTRON MICROSCOPY. 

22 (residues 1 to 760) 

Lawrence, CM. , Ray,S., Babyonyshev, M. , Galluser,R., Borhani,D.W. 
and Harrison, S.C. 

Crystal structure of the ectodomain of human transferrin receptor 
Science 286 (5440), 779-782 (1999) 
20002984 

X-RAY CRYSTALLOGRAPHY (3.2 ANGSTROMS) OF 121-7 60. 

23 (residues 1 to 760) 

Douabin-Gicquel, V. , Soriano, N., Ferran,H., Wojcik,F., Palierne,E., 
Taraim,S., Jovelin,T., McKie,A.T., Le Gall, J. -Y., David, V. and 
Mosser, J. 

Identification of 96 single nucleotide polymorphisms in eight genes 
involved in iron metabolism: efficiency of bioinformatic extraction 
compared with a systematic sequencing approach 
Hum. Genet. 109 (4), 393-4 01.(2001) 
21558434 

VARIANT GLY-142. 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: //www. expasy. ch/sprot 
and http: //www. ebi . ac . uk/sprot 

[FUNCTION] Cellular uptake of iron occurs via receptor-mediated 
endocytosis of ligand-occupied transferrin receptor into 
specialized endosomes . Endosomal acidification leads to iron 
release. The apotransf errin-receptor complex is then recycled to 



JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 



JOURNAL 
MEDLINE 
REMARK 
COMMENT 
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the cell surface with a return to neutral pH and the concomitant 
loss of affinity of apotransf errin for its receptor. Transferrin 
receptor is necessary for development of erythrocytes and the 
nervous system (By similarity) . A second ligand, the heditary 
hemochromatosis protein HFE, competes for binding with transferrin 
for an overlapping C-terminal binding site. 

[SUBUNIT] Homodimer; disulf ide-linked . Binds one transferrin or HFE 
molecule per polypeptide chain. Binds the HLA class II 
histocompatibility antigen, DRl , 

[SUBCELLULAR LOCATION] Type II membrane protein. The sTfr isoform 
is secreted. 

[INDUCTION] Regulated by cellular iron levels through binding of 
the iron regulatory proteins, IRPl and IRP2, to iron-responsive 
elements in the 3'-UTR. Up-regulated upon mitogenic stimulation. 
[PTM] N- and 0-glycosylated, phosphorylated and palmitoylated. The 
serum form is only glycosylated. 

[PTM] Proteolytically cleaved on Arg-100 to produce the soluble 
serum form (sTfR) . 

[MISCELLANEOUS] Serum transferrin receptor (sTfR) is used as a 
means of detecting erythropoietin (EPO) misuse by athletes and as a 
diagnostic test for anemias resulting from a number of conditions 
including rheumatoid arthritis, pregnancy, irritable bowel syndrome 
and in HIV patients. 

[MISCELLANEOUS] Canine and feline parvoviruses bind human and 
feline transferrin receptors and use these receptors to enter and 
infect cells . , 

[SIMILARITY] BELONGS TO PEPTIDASE FAMILY M28B. 
[DATABASE] NAME=PROW; NOTE=CD guide CD71 entry; 
WWW= • http : //www . ncbi . nlm . nih . gov/prow/cd/cd7 1 . htm ' . 



FEATURES 



Location/Qualifiers 
1. .760 



source 



Site 



Site 



Site 



Site 



Region 



Proteii 



/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
1..7 60 

/gene="TFRC" 
1. .760 

/gene="TFRC" 

/product="Transferrin receptor protein 1" 
1..67 

/gene="TFRC" 
/ region_name= " Domain " 
/note="CYTOPLASMIC (POTENTIAL) ." 
9. . 12 

/gene="TFRC " 

/site_type="mutagenized" 

/note="FSNL->YTRF: ONLY 80 % AS ACTIVE AS WILD-TYPE 
RECEPTOR. UPTAKE OF THE RECEPTOR." 
20. .34 

/gene="TFRC " 

/site_type="mutagenized" 

/note="YTRFSLARQVDGDNS->PPGYSLARQVDYTRF: NO INFLUENCE ON 
ENDOCYTIC UPTAKE OF THE RECEPTOR." 
20. .23 

/gene="TFRC" 

/site_type="mutagenized" 

/note="YTRF->PPGY: ONLY 16% AS ACTIVE AS WILD-TYPE 
RECEPTOR. " 
20. .23 

/gene="TFRC" 

/site_type= "unclassified" 
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/note="ENDOCYTOSIS SIGNAL." 
20 

/gene="TFRC" 

/site_type="mutagenized" 

/note="Y->C: ONLY 35% AS ACTIVE AS WILD-TYPE RECEPTOR." 
20 

/gene="TFRC" 

/site_type="mutagenized" 

/note="Y->G: ONLY 20% AS ACTIVE AS WILD-TYPE RECEPTOR." 
21 

/gene="TFRC" 

/site_type="mutagenized" 

/note="T->TA: ONLY 14% AS ACTIVE AS WILD-TYPE RECEPTOR." 
21 

/gene="TFRC" 

/site_type="mutagenized" 

/note="T->TAA: ONLY 19% AS ACTIVE AS WILD-TYPE RECEPTOR." 
21 

/gene="TFRC" 

/site_type="mutagenized" 

/note="T->F: ONLY 88% AS ACTIVE AS WILD-TYPE RECEPTOR." 
23 

/gene="TFRC" ^ 
/site_type="mutagenized" 

/note="F->Y: ONLY 48% AS ACTIVE AS WILD-TYPE RECEPTOR." 
24 

/gene="TFRC" 

/site_type="phosphorylation" 
31. .34 

/gene="TFRC" 

/site_type="mutagenized" 

/note="GDNS->YTRF: 2-FOLD INCREASE OF THE ENDOCYTIC UPTAKE 
OF THE RECEPTOR. " 
47. .50 

/gene="TFRC" 

/site_type="mutagenized" 

/note="NADN->YTRF: 1.27-FOLD INCREASE OF THE ENDOCYTIC 
UPTAKE OF THE RECEPTOR." 
58. .61 

/gene="TFRC" 

/site_type="unclassif led" 
/note="STOP-TRANSFER SEQUENCE." 
62 

/gene="TFRC" 

/ site_type="lipid-binding" 
/note="PALMITATE (MAJOR)." 
67 

/gene="TFRC" 

/site_type="lipid-binding" 
/note="PALMITATE. " 
68. .88 

/gene="TFRC" 

/ region_name="Transniembrane region" 

/note="SIGNAL-ANCHOR (TYPE-II MEMBRANE PROTEIN) 

(POTENTIAL) . " 

89. .760 

/gene="TFRC" 

/region_name=" Domain" 

/note="EXTRACELLULAR (POTENTIAL) 

bond (89) 
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/gene="TFRC" 

/bond_type= "disulfide" 

/note="INTERCHAIN. " 
Bond bond (98) 

/gene="TFRC" 

/bond_type="disulf ide" 

/note="INTERCHAIN. " 
Site 100.. 101 

/gene="TFRC" 

/site_type="cleavage " 

/note="CLEAVAGE (BY TRYPSIN TO PRODUCE SOLUBLE FORM) . " 
Region 104 

/gene="TFRC" 
/region_narae="Conf lict " 

/note="T -> K (IN REF. 7; AA SEQUENCE) ." 
Site 104 

/gene="TFRC" 

/site_type="glycosylation" 

/note="0-LINKED (GALNAC. . . ) . /FTId=CAR_000072 . " 
Region 109 

/gene="TFRC" 
/region_name="Conf lict " 

/note="R -> V (IN REF. 7; AA SEQUENCE)." 
Region . 123 

/gene="TFRC" 
/region_name="Conf lict " 

/note="Y -> T (IN REF. 7; AA SEQUENCE)." 
Region 142 

/gene="TFRC" 
/region_name=" Variant" 

/note="S -> G (RARE POLYMORPHISM). /FTId=VAR_012737 . " 
Site 251 

/gene="TFRC" 

/site_type="glycosylation" 
/note="N-LINKED (GLCNAC. . . ) . " 
Site 317 

/gene="TFRC" 

/site_type="glycosylation" 

/note="N-LINKED (GLCNAC...)." 
Region 569.. 760 

/gene="TFRC" 

/region_name=" Domain" 

/note="LIGAND-BINDING. " 
Site 619 

/gene="TFRC" 

/site_type="mutagenized" 

/note="L->A: 20-FOLD REDUCED AFFINITY FOR TRANSFERRIN 
RECEPTOR. NO BINDING TO HFE." 
Site 622 

/gene="TFRC" 

/s it e_type= "mutagen! zed" 

/note="V->A: NO SIGNIFICANT EFFECT ON BINDING TO 
TRANSFERRIN NOR HFE." 
Site 623 

/gene="TFRC" 

/site_type="mutageni2ed" 

/note="R->A: NO SIGNIFICANT EFFECT ON BINDING TO 
TRANSFERRIN NOR HFE. TRANSFERRIN. NO BINDING TO HFE." 
Site 629 

/gene="TFRC" 

http://ww.ncbi.nlm.nih.^v/^t^gg.7^|^.fjjgjj;^g^ 5/28/2003 
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/site_type="mutagenized" 

/note="R->A: >5-F0LD REDUCED AFFINITY FOR TRANSFERRIN. 
>10-FOLD REDUCED AFFINITY FOR HFE." 
Site 640 

/gene="TFRC" 

/site_type="rautagenized" 

/note="Q->A: NO EFFECT ON BINDING TO TRANSFERRIN. >10-FOLD 
REDUCED AFFINITY FOR HFE." 
Site ■ 641 

/gene="TFRC" 

/site_type="mutagenized" 

/note="W->A: NO SIGNIFICANT EFFECT ON BINDING TO 
TRANSFERRIN NOR HFE." 
Site 643 

/gene="TFRC" 

/site_type="inutagenized" 

/note="Y->A: 20-FOLD REDUCED AFFINITY FOR TRANSFERRIN. NO 
BINDING TO HFE. " 
Site 644 

/gene="TFRC" 

/ site_type="inutageni2ed" 

/note="S->A: NO SIGNIFICANT EFFECT ON BINDING TO 
TRANSFERRIN NOR HFE." 
Site 646. .648 

/gene="TFRC" 

/site_type="unclassif ied" 

/note="CELL ATTACHMENT SITE (REQUIRED FOR BINDING TO 
TRANSFERRIN)." 
Site 646 

/gene="TFRC" 

/site_type="mutagenized" 

/note="R->A, H: NO BINDING TO TRANSFERRIN." 
Site 646 

/gene="TFRC" 

/site_type="mutageni2ed" 

/note="R->K: 5% BINDING TO TRANSFERRIN." 
Site 647 

/gene="TFRC" 

/site_type="mutagenized" 

/note="G->A: LARGE EFFECT ON AFFINITY FOR TRANSFERRIN. 
4-FOLD REDUCED AFFINITY FOR HFE." 
Site 64 8 

/gene="TFRC" 

/site_type="mutagenized" 

/note="D->E: 57% BINDING TO TRANSFERRIN." 
Site 648 

/gene="TFRC" 

/site_type="mutagenized" 

/note="D->A: 16% BINDING TO TRANSFERRIN." 
Site 650 

/gene="TFRC" 

/site_type="inutagenized" 

/note="F->Q: >5-F0LD REDUCED AFFINITY FOR TRANSFERRIN. 
>10-FOLD REDUCED AFFINITY FOR HFE." 
Site 727 

/gene="TFRC" 

/site_type="glycosylation" 

/note="N-LINKED (GLCNAC...). /FTId=CAR 000173 . " 

ORIGIN 

1 mmdqarsafs nlfggeplsy trfslarqvd gdnshvemkl avdeeenadn ntkanvtkpk 
http://ww.ncbi.nIm.nih.g^v^P^tjej,',^pg,^f<^^^^^^^ 5/28/2i 
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61 rcsgsicygt iavivfflig fmigylgyck 
121' rlywddlkrk Isekldstdf tstikllnen 
181 vwrdqhfvki qvkdsaqnsv iivdkngrlv 
241 kdfedlytpv ngsivivrag kitfaekvan 
301 ahlgtgdpyt pgfpsfnhtq fppsrssglp 
361 stcrmvtses knvkltvsnv Ikeikilnif 
421 vgtalllkla qmfsdmvlkd gfqpsrsiif 
481 yinldkavlg tsnfkvsasp llytliektm 
541 afpflaysgi pavsfcfced tdypylgttm 
601 Ithdvelnld yerynsqlls fvrdlnqyra 
661 gnaektdrfv mkklndrvmr veyhflspyv 
721 qnngafnetl frnqlalatw tiqgaanals 



gvepktecer lagtespvre epgedfpaar 
syvpreagsq kdenlalyve nqfrefklsk 
ylvenpggyv ayskaatvtg klvhanfgtk 
aeslnaigvl iymdqtkfpi vnaelsffgh 
nipvqtisra aaeklfgnme gdcpsdwktd 
gvikgfvepd hyvvvgaqrd awgpgaaksg 
aswsagdfgs vgatewlegy Isslhlkaft 
qnvkhpvtgq flyqdsnwas kvekltldna 
dtykelieri pelnkvaraa aevagqfvik 
dikemglslq wlysargdff ratsrlttdf 
spkespfrhv fwgsgshtlp allenlklrk 
gdvwdidnef 
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Search I Protein ^ for] 

Limits Preview/index History 

Display | [default >J show:|20 -| Send to | pile ^ Get Subsequence | 



Protein 

Taxonomy OMIM 

~ Go | Clear | 

Clipboard Details 



r 1: Q9UP52. Transferrin recep...[gi:20140912] 



BLink, Domains. Links 



linear PRI 15-SEP-2003 



LOCUS Q9UP52 801 aa 

DEFINITION Transferrin receptor protein 2 (TfR2). 
ACCESSION Q9UP52 
VERSION Q9UP52 GI: 20140912 

DBSOURCE swissprot: locus TFR2_HUMAN, accession Q9UP52; 
class: standard. 

extra accessions : 0754 22 , Q9HA99, Q9NX67, created: Feb 28, 2003. 
sequence updated: Feb 28, 2003. 
annotation updated: Sep 15, 2003. 

xrefs: gi : 5596369 , gi : 5596370 , gi : 3135305 , gi : 3135312 , gi : 
10433313 , gi: 10433314 , gi: 7020497 , gi : 7020498 
xrefs (non-sequence databases): HSSPP02786, MEROPSM28 . 973, 
GenewHGNC: 11762, MIM 604720 , MIM 604250, GOGO : 0005887 , 
GOGO: 0004998, GOGO: 0006826, InterProIPR003137 , Pf amPF02225, 
PfamPF04389, PfamPF04253 
KEYWORDS Transmembrane; Glycoprotein; Receptor; Signal-anchor; Alternative 
splicing; Disease mutation. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (residues 1 to 801) 

Kawabata,H., Yang,R., Hirama,T., Vuong,P.T., Kawano,S., 
Gombart,A.F. and Koef f ler, H . P . 

Molecular cloning of transferrin receptor 2. A new member of the 
transferrin receptor-like family 
J. Biol. Chem. 274 (30), 20826-20832 (1999) 
99340005 
10409623 

SEQUENCE FROM N.A. (ISOFORMS ALPHA AND BETA) . 
TISSUE=Erythroleukemia, and Myeloid leukemia cells 

2 (residues 1 to 801) 

Glockner,G., Scherer,S., Schattevoy, R. , Boright,A., Weber, J., 
Tsui,L.C. and Rosenthal, A. 

Large-scale sequencing of two regions in human chromosome 7q22: 
analysis of 650 kb of genomic sequence around the EPO and CUTLl 
loci reveals 17 genes 
Genome Res. 8 (10), 1060-1073 (1998) 
99018118 
9799793 

SEQUENCE FROM N.A. (ISOFORM GAMMA) . 

3 (residues 1 to 801) 

Isogai,T., Ota,T., Hayashi,K., Sugiyama,T., Otsuki,T., Suzuki, Y., 
Nishikawa, T. , Nagai,K., Sugano,S., Shiratori, A. , Sudo,H., 
Wagatsuma,M. , Hosoiri,T., Kaku,Y., Kodaira,H., Kondo,H., 
Sugawara,M., Takahashi , M . , Chiba,Y., Ishida,S., Murakawa,K., 
Ono,Y., Takiguchi,S. , Watanabe,S., Kimura,K., Murakami, K., 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 



JOURNAL 
MEDLINE 
PUBMED 
REMARK 
REFERENCE 
AUTHORS . 
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TITLE 

JOURNAL 

RE^4ARK 

REFERENCE 
AUTHORS 



JOURNAL 
MEDLINE 
■PUBMED 
REMARK 
REFERENCE 
AUTHORS 



JOURNAL 
MEDLINE 
PUBMED 
REMARK 
COMMENT 



Yamamoto, J. , Wakamatsu, A. , 
Masuho,Y., Ninomiya,K. and Iwayanagi,T. 



Ishii,S., Kawai,Y., Saito,K., 
Nakamura, Y . , Nagahari,K., 
Direct Submission 
Submitted (~AUG-2000) 

SEQUENCE OF 1-158 AND 370-801 FROM N.A. 
TISSUE=Carcinoma, and Embryo 

4 (residues 1 to 801) 

Camaschella, C. , Roetto,A., Cali,A., De Gobbi,M., Garozzo,G., 
Carella,M., Majorano,N., Totaro,A. and Gasparini, P . 

The gene fFR2 is mutated in a new type of haemochromatosis mapping 
to 7q22 

Nat. Genet. 25 (1), 14-15 (2000) 

20264357 

10802645 

DISEASE. 

5 (residues 1 to 801) 

Roetto,A., Totaro,A., Piperno,A., Piga,A. , Longo,F., Garozzo,G., 

Cali,A., De Gobbi,M., Gasparini, P. and Camaschella, C. 

New mutations inactivating transferrin receptor 2 in 

hemochromatosis type 3 

Blood 97 (9), 2555-2560 (2001) 

21213521 

1131324L 

VARIANT HFE3 LYS-172. 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http://www.expasv.ch/sprot 
and http: //www. ebi . ac.uk/sprot 

[FUNCTION] Mediates cellular uptake of transf errin-bound iron in a 
non-iron dependent manner. May be involved in iron metabolism, 
hepatocyte function and erythrocyte differentiation. 
[SUBUNIT] Homodimer. 

[SUBCELLULAR LOCATION] Type II membrane protein. The beta isoform 
lacks the transmembrane domain and is probably intracellular. 
[ALTERNATIVE PRODUCTS] Event=Alternative splicing; Named 
isoforms=3; Name=Alpha; IsoId=Q9UP52-l ; Sequence=Displayed; 
Name=Beta; IsoId=Q9UP52-2 ; Sequence=VSP_005354 ; Name=Gamma; 
IsoId=Q9UP52-3; Sequence=VSP_005355 . 

[TISSUE SPECIFICITY] Predominantly expressed in liver. While the 
alpha form is also expressed in spleen, lung, muscle, prostate and 
peripheral blood mononuclear cells, the beta form is expressed in 
all tissues tested, albeit weakly. 

[DISEASE] Defects in TFR2 are' a cause of hereditary hemochromatosis 
type 3 (HFE3) [MIM: 604250] . HFE3 is a disorder of iron hemostasis 
resulting in iron overload and has a phenotype indistinguishable 
from that of hereditary hemochromatosis (HH) . HH is characterized 
by abnormal intestinal iron absorption and progressive increase of 
total body iron, which results in midlife in clinical complications 
including cirrhosis, cardiopathy, diabetes, endocrine dysfunctions, 
arthropathy, and susceptibility to liver cancer. Since the disease 
complications can be effectively prevented by regular phlebotomies, 
early diagnosis is most important to provide a normal life 
expectancy to the affected subjects. 

[MISCELLANEOUS] The variant lys-172 found in hereditary 
hemochromatosis type III affects the putative initiation codon of 
the beta isoform thus preventing its translation. 
[SIMILARITY] BELONGS TO PEPTIDASE FAMILY M28B. 
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FEATURES • 
source 



gene 
Protein 



/FTId=VSP 005354. 



Location/Qualifiers 
1. .801 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
1. .801 

/gene="TFR2 " 
1. .801 

/gene="TFR2 " 

/product="Transferrin receptor pr 
1. . 171 

/gene="TFR2 " 

/region_name="Splicing variant " 
/note="Missing (in isoform Beta) . 
1. .83 

/gene="TFR2 " 
/region_name=" Domain" 
/note="CYTOPLASMIC (POTENTIAL) ." 
23. .26 
/gene="TFR2 " 

/site_type="unclassified" 
/note="ENDOCYTOSIS SIGNAL (POTENTIAL)." 
84 . . 104 
/gene="TFR2 " 

/region_name="Transmembrane region" 

/note="SIGNAL-ANCHOR (TYPE-II MEMBRANE PROTEIN) 

(POTENTIAL) . " 

105. .801 

/gene="TFR2" 

/region_name=" Domain" 

/note="EXTRACELLULAR (POTENTIAL) ." 

bond (108) 

/gene="TFR2 " 

/bond_type="disulf ide" 

/note="INTERCHAIN (POTENTIAL) ." 

bond (111) 

/gene="TFR2" 

/bond_type="di sulfide" 

/note="INTERCHAIN (POTENTIAL) ." 

172 

/gene="TFR2" 
/region_name=" Variant" 
/note="M -> K (IN HFE3) . 



/FTId=VAR 012738. 



240 

/gene="TFR2" 

/site_type="glycosylation" 

/note="N-LINKED (GLCNAC. . . ) (POTENTIAL) . " 
339 

/gene="TFR2" 

/site_type="glycosylation" 

/note="N-LINKED (GLCNAC. . . ) (POTENTIAL) . " 

343. .369 

/gene="TFR2" 

/region_name="Splicing variant" 

/note="Missing (in isoform Gamma) . /FTId=VSP_005355 . 
540 

/gene="TFR2 " 

/site_type="glycosylation" 

/note="N-LINKED (GLCNAC ) (POTENTIAL)." 

712 

/gene="TFR2" 
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/region_name="Conf lict" 
/note="R -> RIPLSAQV (IN REF. 2)." 
Site 754 

/gene="TFR2 " 

/site_type="glycosylation" 
/note="N-LINKED (GLCNAC...) (POTENTIAL)'." 

ORIGIN 

1 merlwglfqr aqqlsprssq tvyqrvegpr kghleeeeed geegaetlah fcpmelrgpe 
61 plgsrprqpn lipwaaagrr aapylvltal liftgafllg yvafrgscqa cgdsvlvvse 
121 dvnyepdldf hqgrlywsdl qamflqflge grledtirqt slrervagsa gmaaltqdir 
181 aalsrqkldh vwtdthyvgl qfpdpahpnt Ihwvdeagkv geqlpledpd vycpysaign 
241 vtgelvyahy grpedlqdlr argvdpvgrl llvrvgvisf aqkvtnaqdf gaqgvliype 
301 padfsqdppk pslssgqavy ghvhlgtgdp ytpgfpsfnq tqfppvassg Ipsipaqpis 
361 adiasrllrk Ikgpvapqew qgsllgspyh Igpgprlrlv vnnhrtstpi nnifgciegr 
421 sepdhyvvig aqrdawgpga aksavgtail lelvrtfssm vsngfrprrs llfiswdggd 
481 fgsvgstewl egylsvlhlk avvyvsldna vlgddkfhak tsplltslie svlkqvdspn 
541 hsgqtlyeqv vftnpswdae virplpradss aysftafvgv pavefsfmed dqaypflhtk 
601 edtyenlhkv Iqgrlpavaq avaqlagqll irlshdrllp Idfgrygdvv Irhignlnef 
661 sgdlkarglt Iqwvysargd yiraaeklrq eiysseerde rltrmynvri mrvefyflsq 
721 yvspadspfr hifmgrgdht Igalldhlrl Irsnssgtpg atsstgfqes rfrrqlallt 
781 wtlqgaanal sgdvwnidnn f 

// 
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Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION BLASTP 2.2.5 [Nov-16-2002] 

Matrix |BLOSUM62 _^ gap open:|l1 gap extension: |1 
x_dropoff: |50 expect:| 10.0001 wordsize: |3 Filter T Align | 

Sequence 1 lcl|seq_l Length 560 OP^T m rb Y'^U^ 5H 

Sequence 2 lcl|seq_2 Length 760 — rtURo ^i/vui- ^t^ycS^cM I f O Z- 7^ <^ 
No signiflcant similarity was found 
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Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION BLASTP 2.2.5 INov-16-2002] 
Matrix I BLOSUM62il 

gap openifTl gap extension: fl ~ 
x_dropoff: [50 expect:|lp.OOOi wordsize: [3 Filter T Align | 

Sequence 1 lcl|seq_l Length 560 Z VY^ii 3^7 

Sequence 2 lcl|seq_2 Length 801 Trcs^^^ ge-tCja/*^ HiieU.<^ Q,<^U P^:l 
No significant similarity was found 
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Search [Protein Tf for| 

Limits Preview /Index History 

, Display t default Ushow: |20 Send to [ File ^ sGet Subsequence 



5 Protein 

Taxonomy OMIM Boo 

Go I Clear | 

Clipboard Details 



r 1: NP_066563. neuromedin B [Hom...[gi: 10835 107] 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



121 aa 
[Homo sapiens] . 



NMB 

neuromedin 
NP_066563 
NP_066563.1 GI:10835107 
REFSEQ: accession NM_021077 



linear PRI 06-APR-2003 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata 
Mammalia; Eutheria; Primates 
1 (residues 1 to 121) 
Richard, I., Broux,0., Chiannilkulchai, N . , Fougerousse, F. , 
Allamand,V., Bourg,N., Brenguier, L . , Devaud,C., Pasturaud, 
Roudaut,C. et al. 

Regional localization of human chromosi 
Genomics 23 (3), 619-627 (1994) 
95154832 
7851890 

to 121) 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



15 loci 



Chin,W.W. and Spindel,E.R. 



2 (residues 

Krane,I.M., Naylor,S.L., Helin-Davis , D . 

Molecular cloning of cDNAs encoding the human bombesin-like peptide 
neuromedin B. Chromosomal localization and comparison to cDNAs 
encoding its amphibian homolog ranatensin 
JOURNAL J. Biol. Chem. 263 (26), 13317-13323 (1988) 
MEDLINE 88330837 
PUBMED 2458345 

COMMENT PROVISIONAL REFSEQ : This record has not yet been subject to final 

NCBI review. The reference sequence was derived from M21551 . 1 . 
FEATURES Location/Qualifiers 
source 1 . . 121 

/organism="Homo sapiens" 

/db_xref="taxon: 9605" 

/chromosome=" 15 " 

/map="15q22-qter" 

/tissue_type= "hypothalamus" 

/tissue_lib="of R.Goodman" 
Protein 1..121 

/product="neuromedin B" 
siq peptide 1..24 

/gene="NMB" 

/note="G00-120-237" 



mat peptide 



25. .57 

/gene="NMB" 

/product="neuromedin B" 
/note="G00-120-237" 
1. . 121 
/gene="NMB" 
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/coded_by="NM_021077.2:37. .402" 
/cib_xre f = " Locus I D : 4828 " 
/db xref="MIM: 162340 " 

ORIGIN 

1 marraggarm fgslllfall aagvaplswd Ipeprsrask irvhsrgnlw atghfmgkks 
61 lepsspshwg qlptpplrdq rlqlshdllg illlkkalgv slsrpapqiq yrrllvqilq 
121 k 

// 
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Search | Protein 



. Display | default 



~3forr 

Limits 



Preview/Index 

3show: |20 J Send to [ nie 



.5 Protein 



Taxonomy 

Go I Clear I 



History Clipboard 
^ : ^ Get Subsequence | 



r~ 1: NP_002501. glycoprotein (tra...[gi:4505405] 



BLink, Domains, Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



variation 



GPNMB 560 aa linear PRI 05-APR-2003 

glycoprotein (transmembrane) nmb; transmembrane glycoprotein [Homo 
sapiens] . 
NP_002501 

NP_002501.1 GI:4505405 
REFSEQ: accession NM 002510.1 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (residues 1 to 560) 

Weterman,M. A. , Ajubi,N., van Dinter,I.M., Degen,W.G., van 
Muijen,G.N., Ruitter,D.J. and Bloemers, H . P . 

nmb, a novel gene, is expressed in low-metastatic human melanoma 

cell lines and xenografts 

Int. J. Cancer 60 (1), 73-81 (1995) 

95113576 

7814155 

REVIEWED REFSEQ : This record has been curated by NCBI staff. The 
reference sequence was derived from X76534 . 1 . 

Summary: GPNMB is a transmembrane glycoprotein which shows homology 
to the pMEL17 precursor, a melanocyte-specif ic protein. GPNMB shows 
expression in the lowly metastatic human melanoma cell lines and 
xenografts but does not show expression in the highly metastatic 
cell lines; although, it is also expressed in other tissues and 
tumor cell lines as well. GPNMB may be involved in growth delay and 
reduction of metastatic potential. 

Location/Qualifiers 

1. .560 

/organism="Homo sapiens" 

/db_xref="taxon: 9606" 

/chromosome="7 " 

/map="7pl5" 
\ 1..560 

/product="glycoprotein (transmembrane) nmb" 

/not e=" transmembrane glycoprotein" 
.ox\ 195 

/allele="C" 

/allele="S" 

/db_xref="dbSNP: 530436" 
197 

/allele="H" 
/allele="N" 

/ d b_ X r e f =" db S N P : 530413 " 
259. .319 
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/region_name="Repeats in polycystic kidney disease 1 

(PKDl) and other proteins" ■ 

/note="PKD" 

/db xref="CDD: smartOOQ89 " 
CDS 1..560 . . 

/gene="GPNMB" 

/coded_by="NM_002510 . 1 : 92 . . 1774 "■ 
/db_xref="LocusID: 10457" 
/db_xre f = "MIM : 604368 " 

ORIGIN 

1 meclyyflgf lllaarlpld aakrfhdvlg nerpsaymre hnqlngwssd endwneklyp 
61 vwkrgdmrwk nswkggrvqa vltsdspalv gsnitfavnl ifprcqkeda ngnivyeknc 
121 rneaglsadp yvynwtawse dsdgengtgq shhnvfpdgk pfphhpgwrr wnfiyvfhtl 
181 gqyfqklgrc svrvsvntan vtlgpqlmev tvyrrhgray vpiaqvkdvy vvtdqipvfv 
241 tmfqkndrns sdetflkdlp imfdvlihdp shflnystin ykwsfgdntg Ifvstnhtvn 
301 htyvlngtfs Inltvkaaap gpcppppppp rpskptpslg pagdnplels ripdencqin 
361 ryghfqatit ivegilevni iqmtdvlmpv pwpesslidf vvtcqgsipt evctiisdpt 
421 ceitqntvcs pvdvdemcll tvrrtfngsg tycvnltlgd dtslaltstl isvpdrdpas 
481 plrmansali svgclaifvt visllvykkh keynpiensp gnvvrskgls vflnrakavf 
541 fpgnqekdpl Iknqefkgvs 

// 



Disclaimer | Write to the Help Desk 
NCBI I NLM I NIH 



http://www.ncbi.nliii.nili.t^^/p^^gpy^^^.fj^^^g^6^^ 4/22/2003 



Blast Result 



Page 1 of 1 



Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION BLASTP 2.2.5 [Nov-1 6-2002] 

Matrix |BLOSUM62 H gap open:|l1 gap extension: ^ 

x dropoff: [50 expect:| 10,0001 wordsize: |3 Filter C Align | 

Sequence 1 lcl|seq_l Length 121 A/ l^e> /VJ ^ 6 6 3 

Sequence 2 lcl|seq_2 Length 560 <5i pN^f^ ^ O OZ<Co\ 
No signiflcant similarity was found 




! ,. 

i http://www.ncbi.nlm.mh.gov/blast^l2seq/wblast2.cgi? 

Copied from 10030861 on 03/21/2006 



5/28/2003 



RID=105 1035808-05395-3586, 



^SBS^K' results of BLAST 

BLASTP 2.2.6 [Apr-09-2003] 

Reference : 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipraan (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs". Nucleic Acids Res. 25:3389-3402. 

RID: 1051035808-05395-3586 

Query= 

(121 letters) 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,415,660 sequences; 455,667,871 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 

Taxonomy reports 



Distribution of 18 Blast Hits on the Query Sequence W 




Score E 

Sequences producing significant alignments: (bits) Value 

qi|10835107|ref INP 066563.11 neuromedin B [Homo sapiens] >g... 149 9e-36 D 

qi 120141490 I spl P08949INEUB HUMAN Neuromedin B-32 precursor . . . 131 2e-30 D 

qil 13938517 |qb|AAH07 4 07. 1IAAH074 07 Similar to neuromedin B . . . lie 2e-26 

qi|88253 |pir| IA28945 neuromedin B precursor - human 103 4e-22 
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qi|13386018|ref INP 080799.11 RIKEN cDNA 3110023K12 [Mus mus . 
qi 1 112182 I pir I IA37178 neuromedin B precursor - rat 
qi 1 128094 | sp I P01297 | NEUB PIG Neuromedin B-32 [Contains : Neu. 
qi 1 1216411 spl P29007 IGRP BOMOR Gastrin-releasing peptide pre. 
qi 1 123714 01 qb I AAC5 9785. 1 1 SAP bombesin preprohormone 
qi|904 94 82|qb|AAF82387.1|AF111028 1 gastrin-releasing pepti. 
qi|422599|pir| IA47201 bombesinlike peptide - African clawed. 
qi 1 1171692 I spl P43443 INEUB XENLA Neuromedin B precursor >gi I . 
qi| 1237138 |qb|AAC59784. 1 | Phe-13 bombesin preprohormone >gi . 
qi|115097|sp|P01296|BOMB BOMVA Bombesin precursor >gi| 257 63. 
qi I 115096 | spl P21591 1 BOMB BOMOR Bombesin precursor >gi| 10397. 
qi I 87496 I pir I IA26182 gastrin-releasing peptide precursor sp. 
qi I 1346185 I spl P47851|GRP_ SHEEP Gastrin-releasing peptide pr . 
qi|2506229|sp|P08947|LITP PHYSA [ Phe8 ] -phyllolitorin precur. 

Alignments 



9e-19 
5e-18 
le-06 
0. 18 
0.36 
0. 52 
0. 63 
0. 69 
0. 72 



2.8 
5.0 



Get selected sequences | Select all | Deselect all | 

r > qi| 10835107 Iref INP 066563.11 D neuromedin B [Homo sapiens] 
"H- neuromedin B 



qi I 18 9228 Iqbl AAA59934 . 1 1 
Length = 121 



Score = 149 bits (376), Expect = 9e-36 

Identities = 90/121 (74%), Positives = 90/121 (74%) 



Query: 1 MARRAGGARMFGSXXXXXXXXXXXXXXSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

MARRAGGARMFGS SWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 
Sbjct: 1 MARRAGGARMFGSLLLFALLAAGVAPLSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

Query: 61 LEPSSPSHWGQLPTPPLRDQRXXXXXXXXXXXXXXXXXGVSLSRPAPQIQYRRLLVQILQ 120 

LEPSSPSHWGQLPTPPLRDQR GVSLSRPAPQIQYRRLLVQILQ 
Sbjct: 61 LEPSSPSHWGQLPTPPLRDQRLQLSHDLLGILLLKKALGVSLSRPAPQIQYRRLLVQILQ 120 

Query: 121 K 121 
K 

Sbjct: 121 K 121 



r > qi|201414 90|sp|P0894 9|NEUB HUMAN Neuromedin B-32 precursor [Contains: Neuromedi 
qi| 14 25034 5 Iqbl AAH08 603.il AAH08 603 D neuromedin B [Homo sapiens] 
Length = 121 

Score = 131 bits (329), Expect = 2e-30 

Identities = 82/121 (67%), Positives = 82/121 (67%) 

Query: 1 MARRAGGARMFGSXXXXXXXXXXXXXXSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

MARRAGGARMFGS SWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 
Sbjct: 1 MARRAGGARMFGSLLLFALLAAGVAPLSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

Query: 61 LEPSSPSHWGQLPTPPLRDQRXXXXXXXXXXXXXXXXXGVSLSRPAPQIQYRRLLVQILQ 120 

LEPSSPS G LRDQR GVSLSRPAPQIQYRRLLVQILQ 

Sbjct: 61 LEPSSPSPLGTATHTSLRDQRLQLSHDLLGILLLKKALGVSLSRPAPQIQYRRLLVQILQ 120 
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Query: 121 K 121 
K 

Sbjct: 121 K 121 



r > qi I 13938517 Iqbl AAH074 07. 1 IAAH074 07 Similar to neuromedin B [Homo sapiens] 
qi| 13938561|qb|AAH07431. 1 IAAH074 31 Similar to neuromedin B [Homo sapiens] 
Length = 154 

Score = 118 bits (295), Expect = 2e-26 

Identities = 71/110 (64%), Positives = 71/110 (64%) 

Query: 1 MARRAGGARMFGSXXXXXXXXXXXXXXSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

MARRAGGARMFGS SWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 
Sbjct: 1 MARRAGGARMFGSLLLFALLAAGVAPLSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

Query: 61 LEPSSPSHWGQLPTPPLRDQRXXXXXXXXXXXXXXXXXGVSLSRPAPQIQ 110 

LEPSSPS G LRDQR GVSLSRPAPQIQ 

Sbjct: 61 LEPSSPSPLGTATHTSLRDQRLQLSHDLLGILLLKKALGVSLSRPAPQIQ 110 



n > qi I 88253 I pir I IA28945 neuromedin B precursor - human 
Length = 76 

Score = 103 bits (258), Expect = 4e-22 
Identities = 62/76 (81%), Positives = 62/76 (81%) 

Query: 1 MARRAGGARMFGSXXXXXXXXXXXXXXSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

MARRAGGARMFGS SWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 
Sbjct: 1 MARRAGGARMFGSLLLFALLAAGVAPLSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

Query: 61 LEPSSPSHWGQLPTPP 7 6 

LEPSSPSHWGQLPTPP 
Sbjct: 61 LEPSSPSHWGQLPTPP 76 



r> qi| 13386018 IreflNP 080799.11 □ RIKEN cDNA 3110023K12 [Mus musculus] 
qi I 20139148 I sp | Q9CR53 | NEUB MOUSE Neuromedin B-32 precursor [Contains: Neuromedin 
qi 1 1284 8356 I dbj | BAB27 922. 1 1 D unnamed protein product [Mus musculus] 
qi|12851724 |dbj |BAB29144.1| D unnamed protein product [Mus musculus] 
qi I 2030661 1 I qb I AAH284 90.il RIKEN cDNA 3110023K12 [Mus musculus] 
Length = 121 

Score =92.8 bits (229), Expect = 9e-19 

Identities = 61/121 (50%), Positives = 68/121 (56%) 

Query: 1 MARRAGGARMFGSXXXXXXXXXXXXXXSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

M R+AG + + +WDLPEPRSRASKIRVH RGNLWATGHFMGKKS 

Sbjct: 1 MTRQAGSSWLLRGLLLFALFASGVAPFNWDLPEPRSRASKIRVHPRGNLWATGHFMGKKS 60 

Query: 61 LEPSSPSHWGQLPTPPLRDQRXXXXXXXXXXXXXXXXXGVSLSRPAPQIQYRRLLVQILQ 120 

LEP S S G P RDQR G++ S PAP IQYRRLL +LQ 

Sbjct: 61 LEPPSLSLVGTAPPNTPRDQRLQLSHDLLRILLRKE<ALGMNFSGPAPPIQYRRLLEPLLQ 120 

Query: 121 K 121 
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Sbjct: 121 K 121 



r" > qi 1 112182 Ipir I IA37178 neuromedin B precursor - rat 
Length = 117 

Score = 90.5 bits (223), Expect = 5e-18 

Identities = 59/115 (51%), Positives = 64/115 (55%) 

Query: 1 MARRAGGARMFGSXXXXXXXXXXXXXXSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKS 60 

M R+AG + SWDLPEPRSRASKIRVH RGNLWATGHFMGKKS 

Sbjct: 1 MTRQAGSTWLLRGLLLFALFVSGITPFSWDLPEPRSRASKIRVHPRGNLWATGHFMGKKS 60 

Query: 61 LEPSSPSHWGQLPTPPLRDQRXXXXXXXXXXXXXXXXXGVSLSRPAPQIQYRRLL 115 

LEP S S G P R+QR G++LS PAP IQYRRLL 

Sbjct: 61 LEPPSLSLVGTAPPITQREQRLQLSHDLLRILLLQKALGMNLSGPAPPIQYRRLL 115 



r > qi I 128094 | sp | P012 97 | NEUB PIG Neuromedin B-32 [Contains: Neuromedin B] 
qi I 69282 Ipir I IBSPGNB . neuromedin B-32 --pig 
Length = 32 

Score = 52.4 bits (124), Expect = le-06 
Identities = 27/29 (93%), Positives = 27/29 (93%) 

Query: 28 SWDLPEPRSRASKIRVHSRGNLWATGHFM 56 

SWDLPEPRSRA KIRVH RGNLWATGHFM 
Sbjct: 4 SWDLPEPRSRAGKIRVHPRGNLWATGHFM 32 



r > qi I 12164 1 1 sp I P29007 | GRP BOMOR Gastrin-releasing peptide precursor (GRP-29) [Cor 
Neuromedin C (GRP-10) ; C-terminal extension peptide 
(CTEP) ] 

qi|283791|pir| IA42437 gastrin-releasing peptide - Bombina orientalis 
qi|211019|qb|AAA514 09.1| gastrin-releasing peptide 
Length = 155 

Score = 35.4 bits (80), Expect = 0.18 

Identities = 14/25 (56%), Positives = 19/25 (76%) 

Query: 38 ASKIRVHSRGNLWATGHFMGKKSLE 62 

AS +++. RG+ WA GH MGKKS+E 
Sbjct: 42 ASLSKIYPRGSHWAVGHLMGKKSIE 66 



f~ > qi|123714 0|qb|AAC597 85.1| SAP bombesin preprohormom 
Length = 119 

Score =34.3 bits (77), Expect = 0.36 

Identities = 15/33 (45%), Positives = 21/33 (63%) 

Query: 33 EPRSRASKIRVHSRGNLWATGHFMGKKSLEPSS 65 

+P ++ S GN WA GHFMGKKSL+ ++ 

Sbjct: 35 DPNNQGGLSLQQSLGNQWARGHFMGKKSLQDTN 67 
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r > qi I 904 94 82 |qb|AAF82387. 1 IAF111Q28 1 gastrin-releasing peptide precursor [Caras; 
Length = 157 

Score = 33.9 bits (76), Expect = 0.52 

Identities = 19/43 (44%), Positives = 23/43 (53%), Gaps = 5/43 (11%) 

Query: 42 RVHSRGNLWATGHFMGKKS ^^LEPSSPSHWGQLPTPPLRDQ 80 

+V+ RGN WA GH MGKKS + P P G + RDQ 
Sbjct: 35 KVYPRGNHWAVGHLMGKKSTDEQVRPEDPED-GDETSMTTRDQ 7 6 



r> qi|422599|pir| IA47201 bombesinlike peptide - African clawed fro< 
Length = 120 

Score = 33.5 bits (75), Expect = 0.63 

Identities = 18/33 (54%), Positives = 20/33 (60%), Gaps = 3/33 (9%) 



Query: 30 DLPEPRSRASKIRVHSRGNLWATGHFMGKKSLE 62 

+ E KIR RGN WA GHFMGKKSL+ 
Sbjct: 31 EFSEDARNIEKIR RGNQWAIGHFMGKKSLQ 60 



r > qi|1171692|sp|P43443|NEUB XENLA Neuromedin B precursor 
qi I 214 614 | qb | AAA4 9912 . 1 1 neuromedin 
Length = 120 

Score = 33.5 bits (75), Expect = 0.69 

Identities = 18/33 (54%), Positives = 20/33 (60%), Gaps = 3/33 (9%) 



Query: 30 DLPEPRSRASKIRVHSRGNLWATGHFMGKKSLE 62 

+ E KIR RGN WA GHFMGKKSL+ 
Sbjct: 31 EFSEDARNIEKIR RGNQWAIGHFMGKKSLQ 60 



r> qi| 1237138 I qb I AAC59784.il Phe-13 bombesin preprohormone 
qi| 1589538|prf I I2211320A bombesin 
Length = 119 

Score = 33.5 bits (75), Expect = 0.72 

Identities = 13/16 (81%), Positives = 14/16 (87%) 



Query: 4 7 GNLWATGHFMGKKSLE 62 

GN WA GHFMGKKSL+ 
Sbjct: 4 9 GNQWAVGHFMGKKSLQ 64 



r > qi 1 115097 I sp I PO 1296 I BOMB_ BOMVA Bombesin precursor 
qi I 25763397 Ipir I IBSTDY bombesin precursor - yellow-bellied toad 
qi I 65309 I emb | CAA3668 6. 1 1 bombesin [Bombina variegata] 
Length = 107 
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Score = 31.6 bits (70), Expect = 2.3 

Identities = 12/16 (75%), Positives = 13/16 (81%) 

Query:' 4 7 GNLWATGHFMGKKSLE 62 

GN WA GH MGKKSL+ 
Sbjct: 4 6 GNQWAVGHLMGKKSLQ 61 



r > qi|115096|sp|P21591|BOMB BOMOR Bombesin precursor 
qi 1 103971 1 pir I IA39261 bombesin precursor - Bombina orientalis 
qi I 211017 I qb I AAA4 8551.il bombesin 
Length = 119 

Score =31.2 bits (69), Expect =2.8 

Identities = 15/31 (48%), Positives = 19/31 (61%), Gaps = 1/31 (3%) 

Query: 33 EPRSRASKIRVHSR-GNLWATGHFMGKKSLE 62 

E . + +1 + R GN WA GH MGKKSL+ 
Sbjct: 34 EDPNNQGRISLQQRLGNQWAVGHLMGKKSLQ 64 



F> qi|87496|pir| IA26182 gastrin-releasing peptide precursor spli 
Length = 138 

Score =30.4 bits (67), Expect =5.0 

Identities = 18/32 (56%), Positives = 21/32 (65%), Gaps = 1/32 {3% 

Query: 42 RVHSRGNLWATGHFMGKKSL-EPSSPSHWGQL 72 

+++ RGN WA GH MGKKS E SS S G L 
Sbjct: 36 KMYPRGNHWAVGHLMGKKSTGESSSVSERGSL 67 



r > qi I 134 6185 I sp | P4 7851 1 GRP SHEEP Gastrin-releasing peptide precursor (GRP) [Conti 
(GRP-10)] 

. qi|2136897|pir| II47010 gastrin-releasing peptide - sheep 
qil 913168 |qb|AAB3267 5. II Gastrin-releasing peptide GRP [Ovis aries] 
Length = 134 

Score = 30.0 bits (66), Expect =7.8 

Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 9/42 (21%) 

Query: 32 PEPRSRASKI RVHSRGNLWATGHFMGKKSLEPS 64 

P PR A+ + ++++RGN WA GH MGKKS+ S 

Sbjct: 17 PAPRGSAAPVTAGRAGALAKMYTRGNHWAVGHLMGKKSVAES 58 



r~ > qi|2506229|sp|P08947|LITP PHYSA [ Phe8 ] -phyllolitorin precursor 
qil 1362698 Ipir I IB57058 Phe-8 phyllolitorin precursor - Sauvage ' s leaf frog 
qi|998450|qb|AAB32788.1| Phe8 phyllolitorin [Phyllomedusa sauvagei] 
Length = 90 

Score = 29.6 bits (65), Expect =8.6 

Identities = 17/33 (51%), Positives = 19/33 (57%), Gaps = 1/33 (3%) 
Query: 30 DLPEPRSRASKIRVHSRGNLWATGHFMGKKSLE 62 
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++ E SK V R LWA G FMGKKSLE 

Sbjct: 32 EVTEESDDLSKRNVLQR-QLWAVGSFMGKKSLE 63 



Get selected sequences 



Select all 



Deselect all 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

Posted date: Apr 21, 2003 1:41 AM 
Number of letters in database: 455,667,871 
Number of sequences in database: 1,415,660 

Lambda K H 

0.319 0.134 0.424 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 89,060,364 

Number of Sequences: 1415660 

Number of extensions: 2728065 

Number of successful extensions: 4328 

Number of sequences better than 10.0: 24 

Number of HSP's better than 10.0 without gapping: 23 

Number of HSP's successfully gapped in prelim test: 1 

Number of HSP's that attempted gapping in prelim test: 4305 

Number of HSP's gapped (non-prelim): 24 

length of query: 121 

length of database: 455,667,871 

effective HSP length: 97 

effective length of query: 24 

effective length of database: 318,348,851 

effective search space: 7640372424 

effective search space used: 7640372424 

T: 11 

A: 40 

XI: 16 ( 7.4 bits) 
X2: 38 (14.6 bits) 
X3: 64 (24 .7 bits) 
SI: 41 (21.8 bits) 
S2: 65 (29.6 bits) 
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